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The review of research related to reinforcement presented in the 
Section I of this report is based largely on studies of subhuman subjects, 
but enthusiasm for applying the knowledge thus derived to problems of 
education has not been accompanied by experimentation to demonstrate that 
such knowledge is applicable. The studies which follow in this section 
of the report are explorations of the applicability of reinforcement 
concepts to the planning of human learning situations similar to those 
that occur in schools. 

The general purposes of these studies were the following: 

1. To relate the degree of acquisition of the task to reinforcing 
conditions in situations similar to those occurring in the 
classroom. 

,2. To determine the amount of learning which takes place when 

one pupil observes another pupil in a recitation type situation. 

3. To determine the effectiveness of various reinforcing state- 
ments on the part of the teacher in a recitation type of 

s i tuat i on. 

4. To compare reinforcements provided by written materials with 
the pupil working by himself with reinforcements provided in 
recitat i on-type situations. 

5. To compare the effectiveness of learning situations in which 
the teacher provides the reinforcement in comparison with 
situations in which pupils reinforce the work of pupils. 

6. To compare the effects of pupi.ls.6f different achievement levels 
functioning in teacher roles in relation to other pupils. 

7. To relate the effectiveness of the teacher's reinforcements 
to the distance of the teacher from the pupil. 
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These are the broad questions for which answers were sought, but 
more specific questions will be asked at the beginning of each section 
of the report that follows. 



Method 



The Task 

The first step which had to be undertaken in the development of 
this study was the construction of a learning task which (a) could be 
administered in a learning situation conforming to the response- 
reinforcement learning model, (b) be similar to learning tasks commonly 
observed in classrooms, (c) be such that it could be administered under 
several different learning and reinforcement conditions, and (d) permit 
a reliable measure of learning. The task selected was that of learning 
the meaning of 60 German words. 

The learning task consisted of a German stimulus word which in the 
experiment was to be correctly associated with one of two English noun 
responses. German was selected because it was assumed that there would 
be few children with any familiarity with this language, and in addition, 
that interest in learning foreign words would be high. 

The general format of the task required a subject to guess which 
one of two English words meant the same as a given German word. Rein- 
forcement followed the response. These pairs of English nouns (one of 
which was the equivalent of the German stimulus) had to be selected so 
that one could assume that they had equal association to the German 
stimulus. In providing a pool of such words, all nouns of three to 
eight letters in length (not having some obvious association with the 



o 

ERIC 



1.04 



1 1 



German word) were taken from the Thorndi ke-Lorge (1944, appendix Part 
V) list of “Original Thorndike Words." The pairs were selected separately 
for the first 500 most frequently occurring and for the second 500 most 
frequently occurring words. An additional seven were taken from the list 
of 110 appended to the original 1000, which yielded a total of 314 words 
from which the task could be assembled. 

On the basis of an argument advanced by Underwood and Schulz (I960), 
a second step in providing for equal association value was the assign- 
ment of a numerical frequency value for the first bigram of each selected 
word, i.e., (ap)ple. In the pairing of response choices the bigrams of 
the two words had to have approximately comparable frequencies (as deter- 
mined by the compilers, Underwood and Schulz). 

In addition to the procedures already outlined, each English noun 
was checked for level of readability to insure that words used were 
well within the reading capacity of participating Ss. Durrell, in his 
Appendix A of Improving Reading Instruction (1956) , 1 ists 754 words com- 
piled at the primary grade level as vocabulary for older children in 
remedial reading programs. Each word has been assigned a value ranging 
from 1 to 7 which indicates frequency of occurrence in the literature of 
small children. It was found that 69 per cent of the response nouns in 
the learning task are listed among Durrell 's words. As far as was possibl 
pairs of response choices were matched with respect to Durrell 's read- 
ability values. Those nouns already selected for the task which were 
not part of the Durrell list were checked against another Durrel 1 list 
of vocabulary to be learned in the fourth grade. While not many of the 
learning task nouns were found there, it was obvious that Durrel I's 
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fourth grade list was at a higher level of difficulty. It was concluded 
that the words selected would be well within the range of reading ability 
for the proposed sample of subjects. 

The task was prepared in two different forms. In one forms suitable 
for administration in a group situation, each German word and the accom- 
panying English words were printed in large block letters on sheets of 
card 1 1 X l4 inches. In another form, the materials were typed on paper 
suitable for insertion into a teaching machine (Koncept-o-graph No. 7). 

The acquisition trials were conducted on each of three days: Monday, 

Tuesday and Wednesday. On each of these days a set of twenty words was 
presented with four trials on each. The total of eighty presentations 
per day required approximately forty minutes to complete. The criterion 
measure was not administered until Friday. The test called for recogni- 
tion of the English equivalent among four choices. The three distractors 
included the words used as an incorrect choice in the learning task and 
also two English words derived from other task items. 

The German words and the characteristics of the English equivalent 
and the alternative response word are given in Table 1.01. The order 
in which the words were given and the positions of the correct and 
incorrect alternatives are given in Table 1.02. 

Measurement of Acqu i s i t i on 

Throughout the series of studies which follow, the three main 

« 

sections of the learning task were administered on Monday, Tuesday and 
Wednesday, and the pupils were tested for retention Friday of the same 
week. Originally the plan had been to use a recall test in which the 
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Table l.Ol 

LEARNING TASK— SHOWING THE FREQ.UENCY VALUES 
USED AS THE BASIS FOR EQUATING 
RESPONSE CHOICES 



r .. TASK A— MONDAY 

Stimulus Responses 



(4) 1098 

1. TEIL part 

(4) 1 1 29 

home 

( 3 ) 1028 

2. BAUM tree 

(-) ni7 

corn 

*(-) 700 

3. RAD wheel 

(5) 852 

money 

(4) 1213 

4. HUETE today 

(4) 1271 

1 ight 

( 7 ) 710 

5. SPIESE food 

(3) .761 

bear 

( 3 ) 234 

6. VOGEL bird 

(-) 205 

ki ng 

(6) 404 

7. LEIB body 

(3) 397 

door 



(4) 1028 

8. ZUG train 

(4) 913 

table 

( 6 ) 3473 

9. VORRAT store 

(-) 3473 

storm 





*(6) 


2794 


10. BEIN 


leg 






(5) 

ear 


2516 




(5) 


1055 


1 1 . MUTZE 


cap 






(3) 

car 


1055 




(-) 


1271 


12. RAND 


lip 






(6) 

hat 


1396 




(5) 


76 1 


13. G LOCKE 


bel 1 




(-) 

fish 


721 




(2) 


404 


14. KNABE 


boy 






(5) 

sun 


598 







( 5 ) 


1213 


15. 


GIPFEL ! 


top 








( 2 ) 


1210 






man 








( 7 ) 


413 


16. 


UFER 


bank 








(3) 


4 o 4 






book 








(5) 


1003 


17. 


BERG 


hill 








( 5 ) 


1057 






side 








*( 5 ) 


1721 


18. 


STRASSE 


road 








( 7 ) 


1689 






news 








( 5 ) 


420 


19. 


GESICHT 


fact 






w 


413 






bal 1 








( 7 ) 


1721 


20 . 


FELSEN 


rock 








( 5 ) 


1488 






rain 





o 






•MM 
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Table 1.01 (Continued) 



TASK B— TUESDAY 

Stimulus Responses 







*(-) 


2389 






(-) 


66 






(-) 


3902 


1. 


KUNST 


art 




8. 


El CHE 


oak 




15. 


ZOLL 


i nch 








(5) 


2389 






(-) 


113 






(7) 


2497 






arm 








oil 








tire 








*(6) 


1865 






(5) 


413 






(5) 


398 


2. 


MEER 


sea 




9. 


TASCHE 


bag 




16. 


SPIEL 


game 








(-) 


1721 






(7) 


420 






(7) 


495 






row 








fat 








i dea 








*(-) 


111 






■■'(2) 


342 






(-) 


404 


3. 


HIMMEL 


sky 




10. 


TAG 


day 




17. 


KNOCHEN 


bone 








(4) 


295 






(-) 


404 






(4) 


404 






egg 


' 






bow 








boat 








(7) 


459 






(3) 


2897 






(4) 


846 


4. 


TASSE 


cup 




11. 


KOPF 


head 




18. 


STADT 


town 








(-) 


588 






(6) 


2516 






(9 


720 






age 








east 








mi le 








(6) 


829 






*(2) 


2497 






(-) 


913 


5. 


LUFTEN 


ai r 




12. 


ZEIT 


time 




19. 


SCHWANZ 


tai 1 








(7) 


920 






(-) 


2897 






(7) 


1061 






son 








heat 








i ron 








*(-) 


1307 






(5) 


96 






(7) 


1129 


6. 


GESETZ 


1 aw 




13. 


HOF 


yard 




20. 


STUNDE 


hour 








(-) 


1309 






(-) 


88 






(7) 


1271 






i ce 








knee 








1 ife 








*(6) 


723 






(6) 


598 










7. 


KRIEG 


war 




14. 


FOLGE 


suit 
















(-) 


761 






(7) 


566 











mmmrnm 
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Table 1.01 (Continued) 



Stimulus 


Response 




TASK C— 


WEDNESDAY 












w 


1721 






(5) 920 






(-) 648 


1. 


ZIMMER 


room 




8. 


LIED 


song 


15. 


WOLKE 


cioud 






(-) 


1396 






(6) 1203 






(-) 578 






hall 








ship 






ocean 






w 


1271 






(-) 122 






(7) 90 


2. 


ZEILE 


1 i ne 




9. 


STIMME 


voice 


16. 


RAUCH 


smoke 






(3) 


1396 






(5) 148 






(6) 148 






hand 








f 1 oor 






music 






(5) 


288 






(-) 761 






*(-) 707 


3. 


HOLZ 


wood 




10. 


TIER 


beast 


17. 


RAUM 


space 






(5) 


338 






(-) 700 






(2) 723 






wi nd 








wheat 






water 






(2) 


779 






(-) 1297 






(-) 1028 


4. 


BRUNNEN 


wel 1 




11 . 


HOCHST 


chief 


18 . 


WAHRHEiT 


truth 






w 


779 






(3) 1297 






(-) 1129 






week 








chair 






honor 






(7) 


1028 






(7) 648 






(-) 818 


5. 


AUSFLUG 


trip 




12„ 


TUCH 


cl oth 


19. 


MACHT 


power 






(-) 


1054 






(-) 648 






(7) 703 






1 ord 








clock 






appie 






(-) 


2389 






*(-) 712 






(3) 2879 


6. 


HEER 


army 




13. 


ZAUN 


fence 


20. 


SACHE 


thing 






(7) 


3473 






(5) 723 






(6) 2897 






Star 








watch 






heart 






(5) 


1 129 






(7) 1098 








7. 


LOCH 


hole 




14. 


FARBE 


paint 












(-) 


1117 






(-) 1117 












coal 








court 









Note: 



1. Response words are from “Original Thorndike Words" 1-1000. 

* Items where the response did not originate in the same group of 500. 

2. Values in parenthesis () 1-7 are from Durrell primary- reading- 
level frequencies. Number 1 indicates highest frequency. 

3. Large number values are initial bigram frequencies (Underwood & 
Schulz) . 



o 
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Table 1.02 

ORDER IN WHICH ITEMS WERE PRESENTED TO SUBJECTS* 



Order of 
Items on 
1st aftd 3rd 
Presentat i on 


Pos i t ion 
of Correct 
R on Card 


TASK A— MONDAY 

Order of 
S R Items on 

2nd and 4th 
Presentat i on 


Position 
of Correct 
R on Card 


Subject 
Seating 
Pos i t ion 


1 




bottom 


Teil 


part 


30 


top 


4 


2 




top 


Baum 


tree 


29 


bottom 


2 


3 




bottom 


Rad 


wheel 


28 


top 


1 


4 




bottom 


Heute 


today 


27 


top 


3 


5 




top 


Speise 


food 


26 


bottom 


2 


6 




top 


Vogel 


bird 


25 


bottom 


4 


7 




bottom 


Lei b 


body 


24 


top 


3 


8 




top 


Zug 


train 


23 


bottom 


1 


9 




bottom 


Vorrat 


store 


22 


top 


1 


10 




bottom 


Be i n 


leg 


21 


top 


2 


11 




top 


Mutze 


cap 


40 


bottom 


3 


12 




top 


Rand 


lip 


39 


bottom 


4 


13 




bottom 


G1 ocke 


bell 


38 


top 


4 


14 




top 


Knabe 


boy 


37 


bottom 


2 


15 




bottom 


Gipfel 


top 


36 


top 


1 


16 




top 


Ufer 


bank 


35 


bottom 


3 


17 




top 


Berg 


hill 


34 


bottom 


3 


18 




bottom 


St rasse 


road 


33 


top 


2 


19 




top 


Ges icht 


face 


32 


bottom 


1 


20 




bottom 

_ _ 


Pel sen 


rock 


31 


top 


4 



*See explanatory note bottom of next page. 



o 
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Table 1.02 (Continued) 



Order of 
Items on 
1st and 3rd 
Presentation 


TASK 

Position S 

of Correct 
R on Card 


B— TUESDAY 

Order of 
R Items on 

2nd and 4th 
Presentat i on 


Posi ti on 
of Correct 
R on Card 


Subject 
Seat i ng 
Position 


1 




top 


Kunst 


art 


30 




bottom 


3 


2 




bottom 


Meer 


sea 


29 




top 


4 


3 




bottom 


H i mme 1 


sky 


28 




top 


4 


4 




top 


Tasse 


cup 


27 




bottom 


2 


5 




bottom 


Luf ten 


ai r 


26 




top 


1 


6 




top 


Gesetz 


law 


25 




bottom 


3 


7 




top 


Krieg 


war 


24 




bottom 


3 


8 




bottom 


Eiche 


oak 


23 




top 


2 


9 




top 


Tasche 


bag 


22 




bottom 


1 


10 




bottom 


Tag 


day 


21 




top 


4 


11 




bottom 


Kopf 


head 


40 




top 


4 


12 




top 


Zeit 


t ime 


39 




bottom 


2 


13 




bottom 


Hof 


yard 


38 




top 


1 


14 




bottom 


Folge 


su i t 


37 




top 


3 


15 




top 


Zol 1 


i nch 


36 




bottom 


2 


16 




top 


Spiel 


game 


35 




bottom 


4 


17 




bottom 


Knochen 


bone 


34 




top 


3 


18 




top 


Stadt 


town 


33 




bottom 


1 


19 




bottom 


Schwanz 


tail 


32 




top 


1 


20 




top 


Stunde 


hour 


31 


_g_ m 


bottom 


2 



tions). Note that item 10, Bein, also becomes item 21, and is presented to 
the subject at position 2, Forty presentations are shown here. The same 
order is repeated once so that there are eighty presentations in all. 
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Table 1.02 (Continued) 



Order of 
Items on 
1st and 3rd 
Presentat i on 


Position 
of Correct 
R on Card 


TASK C- 
S 


-WEDNESDAY 

Order of 
R Items on 

2nd and 4th 
Presentat i on 


Posit ion 
of Correct 
R on Card 


Subject 
Seat i ng 
Pos i t ion 


1 




bottom 


Z i mme r 


room 


30 




top 


4 


2 




top 


Zei 1e 


1 i ne 


29 




bottom 


2 


3 




bottom 


Holz 


wood 


28 




top 


1 


4 




bottom 


Brunnen 


wel 1 


27 




top 


3 


5 




top 


Ausf lug 


trip 


26 




bottom 


2 


6 




top 


Heer 


army 


25 




bottom 


4 


7 




bottom 


Loch 


hole 


24 




top 


3 


8 




top 


Lied 


song 


23 




bottom 


1 


9 




bottom 


St imme 


voice 


22 




top 


1 


10 




bottom 


Tier 


beast 


21 




top 


2 


11 




top 


Hochst 


ch ief 


40 




bottom 


3 


12 




top 


Tuch 


cloth 


39 




bottom 


4 


13 




bottom 


Zaun 


fence 


38 




top 


4 


14 




top 


Farbe 


paint 


37- 




bottom 


2 


15 




bottom 


Wol ke 


cloud 


36 




top 


1 


16 




top 


Rauch 


smoke 


35 




bottom 


3 


17 




top 


Raum 


space 


34 




bottom 


3 


18 




bottom 


Wahrhe it 


truth 


33 




top 


2 


19 




top 


Macht 


power 


32 




bottom 


1 


20 




bottom 


Sache 


thing 


31 




top 


4 
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pupils would be given the German words and be required to enter the 
English equivalent. However, such a test was found to be much too 
difficult in relation to the time devoted to learning, so a recogni- 
tion test was developed which gave each German word followed by four 
English words. The pupil was required to choose the correct English 
alternative from the four words given. The four words included the 
pair that had been originally presented with the German word and two 
additional words were added from other items in the learning task. 

The reliability for the score derived from the test was estimated by 
means of the Kuder-Richardson Formula 21 and was found to be 0.72 
(N=l80). This estimate was derived from the cases included in the 
first study to be reported. 

A copy of the recognition test is shown in Table 1.03. The three 
parts of the test were administered in the same order one after the 
other. The test was administered by a member of the staff of the pro- 
ject who read aloud to the pupils the German word and the four English 
words for each item and then allowed time for the pupils to choose an 
answer before moving on to the next item. Pupils were urged to attempt 
each item and very few of the pupils skipped any items. The test was 
scored for the number of correct responses. 
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Table 1.03 



RECOGNITION TEST A (MONDAY'S TASK) ADMINISTERED FRIDAY 





KNABE 




RAND 




SPEISE 




1. 


’ KING 


1. 


’ HAT 


1. 


’ FOOD 


1. 


2. 


BOY 


2. 


NIGHT 


2. 


BED 


2. 


3. 


DOOR 


3. 


TABLE 


3. 


FISH 


3. 


4. 


SUN 


4. 


LIP 


4. 


BEAR 


4. 




GIPFEL 




BERG 




VOGEL 




TT 


' MAN 


TT 


' PEN 


TT 


’ EYE 


IT" 


2. 


SIDE 


2. 


RAIN 


2. 


JOB 


2. 


3. 


TOP 


3. 


HILL 


3. 


KING 


3. 


4. 


LIGHT 


4. 


SIDE 


4. 


BIRD 


4. 




MUTZE 




UFER 




LEIB 




TT 


' HOUSE 


TT 


’ BANK 


TT 


’ WIFE 


TT 


2. 


CAP 


2. 


BOX 


2. 


BODY 


2. 


3. 


NEWS 


3. 


BOY 


3. 


DOOR 


3. 


4. 


CAR 


4. 


BOOK 


4. 


DATE 


4. 




BEIN 




STRASSE 




BAUM 




TT 


LEG 


TT 


' COW 


TT 


' MAN 


TT" 


2. 


EAR 


2. 


NEWS 


2. 


HOME 


2. 


3. 


BED 


3. 


CORN 


3. 


CORN 


3. 


4. 


TIE 


4. 


ROAD 


4. 


TREE 


4. 




GESICHT 




FELSEN 




TEIL 




TT 


BALL 


TT 


RAIN 


TT 


PART 


TT 


2. 


FACE 


2. 


TODAY 


2. 


HILL 


2. 


3. 


PLACE 


3. 


ROCK 


3. 


WHEEL 


3. 


4. 


GIFT 


4. 


CAP 


4. 


HOME 


4. 



GLOCKE 

LEG 

ADD 

BELL 

FISH 

ZUG 

TABLE 

HAT 

TRAIN 

TREE 

HEUTE 

LIGHT 

FOOD 

TODAY 

NIGHT 

VORRAT 

END 

STORM 

STATE 

STORE 

RAD 

WHEEL 

BODY 

MONEY 

PIECE 
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Table 1.03 (Continued) 

RECOGNITION TEST ^ (TUESDAY'S TASK) ADMINISTERED FRIDAY 





LUFTEN 




KUNST 




KOPF 




SPIEL 


1. 


‘ AIR 


1. 


’ ARM 


1. 


’ ART 


1. 


DAY 


2. 


WAR 


2. 


ART 


2. 


HEAD 


2. 


BOW 


3. 


SON 


3. 


ADD 


3. 


TIME 


3. 


GAME 


4. 


BABY 


4. 


AIR 


4, 


EAST 


4. 


IDEA 




TAG 




TASCHE 




ZEIT 




ZOLL 


TT 


’ KNEE 


T7 


■ FAT 


TT 


' TIME 


TT 


■ TIRE 


2. 


BOW 


2. 


IDEA 


2. 


LEG 


2. 


STATE 


3. 


GAME 


3. 


BAG 


3. 


HEAT 


3. 


STORY 


4. 


DAY 


4. 


BOAT 


4. 


TIE 


4. 


INCH 




MEER 




TASSE 




STUNDE 




FOLGE 


TT 


' ROAD 


TT 


■ CUP 


TT 


IRON 


TT 


‘ BOAT 


2. 


SEA 


2. 


AGE 


2. 


LIFE 


2. 


BONE 


3. 


NEWS 


3. 


SKY 


3. 


TAIL 


3. 


SAIL 


4. 


ROW 


4. 


SAIL 


4. 


HOUR 


4. 


SUIT 




KRIEG 




El CHE 




STADT 




SCHWANZ 


TT 


’ TOWN 


TT 


■ OAK 


TT 


' WHEEL 


TT 


■ HOUR 


2. 


BELL 


2. 


EGG 


2. 


TABLE 


2. 


TAIL 


3. 


BEE 


3. 


OIL 


3. 


TOWN 


3. 


IRON 


4. 


WAR 


4. 


YARD 


4, 


MILE 


4. 


LIFE 




GESETZ 




HIMMEL 




KNOCHEN 




HOF 


TT 


’ KNEE 


TT 


‘ OAK 


TT 


' BOAT 


TT 


' YARD 


2. 


IRON 


2. 


OIL 


2. 


BONE 


2. 


KNEE 


3. 


LAW 


3. 


EGG 


3. 


BOX 


3. 


SKY 


4. 


ICE 


4. 


SKY 


4. 


FARM 


4. 


EGG 
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Table 1.03 (Continued) 



RECOGNITION TEST C (WEDNESDAY'S TASK) ADMINISTERED FRIDAY 



HOLZ 

1 . SMOKE 

2. MUSIC 

3. WOOD 

4. WIND 

BRUNNEN 

1 . WELL 

2. WEEK 

3. CLOTH 

4. FENCE 

ZEILE 

1 . HAND 

2. RAIN 

3. LINE 

4. PEN 

ZIMMER 

1. CHAIR 

2. HALL 

3. TRIP 

4. ROOM 

HEER 

1 . STAR 

2. ARMY 

3. HEART 

4. THING 



LOCH 

1 . HOLE 

2. COAL 

3. PAINT 

4. COURT 

LIED 

1 . CLOCK 

2. SHIP 

3. CLOUD 

4. SONG 

AUSFLUG 

1. TRIP 

2. LORD 

3. CHIEF 

4. TRUTH 

STIMME 

1 . SMOKE 

2. VOICE 

3. MUSIC 

4. FLOOR 

SACHE 

1 . STAR 

2. HEART 

3. THING 

4. ARMY 



MACHT 

1. APPLE 

2. POWER 

3. BEAST 

4. WHEAT 

TIER 

1 . BEAST 

2. WHEAT 

3. OCEAN 

4. CLOUD 

HOCHST 

1 . HONOR 

2. CHAIR 

3. TRUTH 

4. CHIEF 

TUCH 

1 . CLOCK 

2. SHIP 

3. CLOTH 

4. SONG 

WOLKE 

1 . OCEAN 

2. MONEY 

3. WHEEL 

4. CLOUD 





ZAUN 


1. 


’ SPACE 


2. 


FENCE 


3. 


WATCH 


4. 


WATER 




FARBE 


IT 


’ COAL 


2. 


PAINT 


3. 


COURT 


4. 


HOLE 




RAUCH 


IT 


SMOKE 


2. 


FLOOR 


3. 


VOICE 


4. 


MUSIC 




RAUM 


TT 


WELL 


2. 


WEEK 


3. 


WATER 


4. 


SPACE 




WAHRHEIT 


TT 


LINE 


2. 


HONOR 


3. 


TRUTH 


4. 


HAND 
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CHAPTER II 



A COMPARISON OF LEARNING UNDER DIRECT REINFORCEMENT 



WITH VICARIOUS REINFORCEMENT 



2.01 



In Part I of this report a review was made of studies of learning in 
which one organism observes another organism learn. The learning which 
occurs under this condition has been named vi ca r ious learning in order to 
distinguish it from learning occurring under more typical laboratory con“ 
ditions in which the learner responds and receives some kind of direct 
reinforcement or feedback. In the studies that follow the two learning 
conditions are referred to as the vicarious conditinn and the direct 
condition. The learning that results is referred to as vi carious learning 
and d i rect learning. 

In the review of the literature, evidence was presented which sub- 
stantiates the position that vicarious learning occurs in primates but 
there is some doubt whether it is a phenomenon in simpler organisms. 
Generally it appears to provide a slower form of learning than direct 
learning. The central problem to be investigated in this study is the 
extent to which vicarious learning in young human learners is comparable 
to learning by a direct reinforcement method. The problem is an important 
one in that much of the work of a typical class in an American elementary 
school is undertaken by a recitation method in which some pupils learn by 
direct reinforcement while others learn by a vicarious process. In the 
study that follows, learning is investigated under conditions that simulate 
closely those found in a recitation situation in a typical classroom, in 
addition, the learning data derived from such a situation will be compared 
with data derived from situations in which the pupil is either learning 
alone or is learning while interacting with a single adult. The experiment 
permits the comparison of the effects of a number of different reinforcing 
conditions . 
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The Re! nforcl ng Condi t Ions 

The study Included four main reinforcing conditions, two Involving a 
group of pupils and two Involving only a single pupil. First let us 
consider the two conditions involving pupils working in groups which will 
be referred to as Conditions 1 and 2. 

The physical characteristics of the experimental arrangement were 
Intended to be suggestive of a classroom situation. A ’’teacher ~ 
experimenter faced eight "pupi T’-subjects who were seated in a somewhat 
familiar classroom row (see Figure 2;01). 

1 . : Direct Reinforcement Condi 1 1 on . —Four of the group ^s were 
separately and directly interacted with by the experimenter. Each of the 
four Ss was presented an equal proportion of I terns from the total task. 

One subject at a time (designated D, and numbered 1, 2, 3, or 4 Figure 
2.01) was cal led upon by name, asked to respond to a stimulus word, and given 
feedback as to the correctness of the response. For example, the experi- 
menter might say, "John, this word is Baum" (the German stimulus word). 

The subject , Jbhh, would then select one of the two English alternatives 
as the equivalent of Baum. If the ^ said "tree," the correct choice, the 
experimenter would say, "right" or "that's right." When he was wrong the 
experimenter said nothing. The remainder of the group listened to the 
exchange. The four actively involved Ss will hereafter be referred to 
as direct reinforcement subjects. 

There were four presentations of each card which contained a stimulus 
and two response choices. All four presentations were always given to the 
same subject. The "nature of the task was such that only one S at a time 
could be directly involved in responding and receiving feedback. This 
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Fig. 2.01 Arrangement of teacher and pupils in the experimental 
recitation situation. The pupils marked D are those that interacted 
directly with the teacher. The V pupils did not interact with the 
teacher and learned vicariously. 
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meant that three of the four would be indirectly, or vicariously involved 
in learning an item which was administered to the fourth. 

The learning condition for the D subjects can be divided into two 
conditions (a) learning on those items to which they responded directly 
(1/4 of items), and (b) learning on those items to which the subjects 
did not respond directly (3/4 of the items). The same ^s serve for both 
treatments. Each ^ responds directly to 1/4 of the items, and listens to 
the other three ^s respond to the remaining 3/4. 

2. Vicarious Reinforcement Condition . — To complete the group of eight, 
four additional ^s were seated alternately to those mentioned above. 

These are the subjects marked with Vs in Figure 2.01. These were not at 
any time directly interacted with. Whatever learning was to occur for them, 
had to occur as a result of observing; that is, hearing the response of a 
direct^ and the consequent feedback of the experimenter. This vicarious 
treatment (all of the items learned vicariously) differs from 3/4 vicarious 
above in that the V subjects are never called upon and thus may develop an 
expectancy of not being called upon at all. 

3. Isolation Condition-Working Alone . A group of four subjects was 
used to represent a "working alone" condition. Subjects were seated before 
a "teaching machine" (Koncept-O-Graph , Model 1 7) from which visual feedback 
was afforded. Responses to the same task described earlier were made 
Internally, not vocally or written. The subject simply advanced the machine 
by hand to the problem, subvocal ly responded, advanced the machine to the 
answer, and repeated the process for the entire task. The type of material 
which composed the task was not of the sort usually used in a teaching 
machine in that it was a non- prog rammed rote learning task. 
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I so 1 3 1 1 on Cond 1 1 1 on With Aura 1 Feedback . Thi s treatment was 
identical to the one just described, with the exception that the experi- 
menter sat nearby and gave vocal feedback as soon as the ^ had announced 
his response. If the S. gave the correct response, the experimenter would 
say, ’right," "that's right," or "correct." If the response was incorrect, 
the experimenter would say nothing. 



The Sample 

Those who participated as subjects in the experiment were 180 
elementary school children (91 males and 89 females). All ^s were naive 
as to the purposes of the study; however, one ^ was removed because she 
had some knowledge of German gained through a; mother of German birth. 

The participants were assigned to particular treatments on the basis 
of a reading score. Measures of reading ability, as well as intelligence 
measures were already available in the schools. Reading was selected for 
equating groups because; (1) performance on the task required some read- 
ing competence, and (2) reading ability and intelligence measures have been 
demonstrated to be highly correlated. The publishers of the Metropolitan 
Reading Tests report correlations between the Metropolitan Reading subtests 
and the Pintner General Ability Tests (4th grade— Pi ntner-Durost , Scale ./ 
2/5th, 6th grade— intermediate verbal) of .83 to^.go. These correlations 
were acquired during the publisher's 1958 national standardization pro- 
gram. However, it should be pointed out that the learning task which was 
used was not one which would place high premium on intelligence as 
measured by such tests. Wechsler (1944, p. 4, 84) points out that intel- 
ligence is characterized by only normal memory ability, and that memory 
span for digits "correlates very poorly with all other tests of 
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intelligence." The learning task employed in this study was similar to 
digit memory span in that both involve rote acquisition. 

Subjects were matched across all treatments where comparisons were 
to be made. For instance, as was mentioned in the introductory chapter, 
the investigator was interested in whether or not there was a learning 
advantage for the ^ at close proximity to the experimenter giving feed- 
back. This meant that in order to test the proximity variable, subjects 
at greater distances had to be matched with those at lesser distances. 

Subjects A and D, B and C, 1 and 4, etc. of the eight member group were 
matched as closely as the available range of scores would permit. Then in 
order to compare direct reinforcement conditions against the vicarious con- 
dition and the isolated conditions, it was necessary to match _Ss across 
these comparisons also. 

Subjects 

The subjects for any particular group were selected from a single 
grade (4th, 5th, or 6th) within a single school. All treatments were 
equally represented within each school-grade. Thus the subjects in 
isolation conditions 3 and 4 ended up with 44 and 40 ^s respectively 
instead of the intended 48. 

Three elementary schools in Salt Lake City provided ^s for the experi- 
ment and one in Bountiful. The three in Salt Lake were Oquirrh, Forest, 
and Stewart and the one in Bountiful was the Boulton Elementary. The 
first of these is located in an obsolescent residential section of the 
downtown area. The second represents a middle class population, and the 
third is the teacher training laboratory school of the University of Utah. 

The fourth school, the Boulton Elementary of Bountiful, Utah, is in a suburban 
residential area. 
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E xperimenter ' s Instructions 



Instructions to those in a group of eight were as follows: 

"This is a part of school; however, it resembles a game 
more than anything else. We will be learning to use words from 
a foreign language. To increase the fun, I'm not going to tell 
you what language It is until a few days later. 

It's important that you pay careful attention, even though 
I may not be talking directly to you. Don't be concerned about 
your turn to answer the problem; Instead, try to learn the 
meaning of the foreign word. Some people won't be asked for 
many answers (4/4 vicarious S_s were only given one word at the 
beginning of the first and second days; these words did not 
count in the analysis). We'd like everyone to learn all the 
words he can, but, or course. It's pretty hard to learn them 
all. (At this point a card was placed on a stand on the experi" 
menter's desk. — see Figure 2 ). 

"Now, let's practice. I'll call your name, and read the 
foreign word to you. You will then read the two English words 
aloud, and tell me which one you think means the same as the 
foreign word. I'll call on only one person at a time, and only 
he should answer. 

"Okay, (S_'s name) your word Is "Saugling;" read the two 
English words and tell me your choice." 



When the subject had made his response to each item, the experi- 
menter said, "right," "good," or "correct" unless the response was wrong, 
in which case the experimenter said nothing. After the feedback had been 
given, the card showing the problem was kept In view for four seconds In 
order to permit subjects to fixate the correct choice. 

Isolates instructions were similar. Instead of the group Instruction 
relative to making oral responses, they were told how to advance the 
machine, how to respond subvocal ly, and how to bring the correct answer 
into view. The subject was told to hesitate after seeing the answer, so 
that he would be able to remember it. He was then told to proceed on his 
own, and to try to connect the foreign word with the appropriate English 
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word. Condition IV was the same as Condition HI except that the teacher 
sat by the pupil and said *‘right“ or "correct" on appropriate occasions. 

The Experimenter . The Data Record i nq 

The experimenter who presented the learning task to the group was a 
young woman who had several years of teaching experience. On occasions^ 
when she was presenting the material to be learned, the scientist in charge 
sat at the rear, or at the side of the room, in order to record the res- 
ponses of the subjects. A separate record was kept of each subject's 
performance on particular items presented to him. 

Another young woman directed the learning under Treatment 4 which 
involved isolation with aural feedback from the experimenter. This experi- 
menter kept her own record of response performance in as inconspicuous a 
manner as possible. Right responses were indicated on a list by placing 
a dot in front of correct items. 

The scientist in charge served as experimenter for the isolation 
Condition 3 subjects. After an initial explanation of how to proceed with 
the "teaching machine," no more contact was made with the subject unless 
the machine jammed. 

The Task To ^ Learned . 

The learning task as previously described consisted of a German 
stimulus word which in the experiment was to be correctly associated with 
one of two English noun responses. Figure 2.02, shows the format of the 
task as it was presented by the teacher-experimenter to each* subject . 

The acquisition trials were conducted for three days: Monday, 

Tuesday, and Wednesday. On each of these days a set of twenty words was 
presented with four trials on each. The total of eighty presentations 
per day required approximately forty minutes to complete. A criterion 



o 



2.09 







Fig. 2.02 Example of task presentation format. The top word is 
German meaning baby. Subject was to select either Baby or Gift as the 
equivalent of Saugllng. On repeated presentations choices were alter- 
nated as to position, i.e., Gift appeared on top, Baby on bottom. Four 
trials on each card were allowed. Cards were 11 x 14 inches. 
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measure was not taken until Friday. 

The task was presented on a series of cards similar to the one illustrated. 
The experimenter stood in front of tijie row of 8 pupils and held up a card. 

He then read the German word and the two English words. The pupil designated 
to respond to the particular card then guessed which answer was correct. 

The experimenter then either told him he was right or took up the next 
card and turned to the pupil who was to respond to it. On each day the 
experimenter went through the cards four times in two different orders. 

Criterion Measures 

A test of retention was given on Friday, This was four days after 
the first learning trials (Mo:. day) , and only two days after the last of 
the trials (Wednesday). 

The test was intended to measure recognition skill. It presented 
the stimulus word and offered four choices for the response as shown on 
pages 1 . 13 “ 1 . 15 . 
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AcquI si t Ion Learn! nq Curves 

Learning Condition 1 and 2 Involved learning In a group of eight 
subjects. Half of these subjects (four) responded orally to the problems 
as presented on cards by the experimenter. For these ^s measures of 
learning were obtainable while acquisition was in progress. However, 
since the remaining four S_s (Condition 2) did not overtly respond, no 
measure of acquisition could be had from them during the three days of 
learning trials. 

There were two isolated (learning alone) treatments each with four 
£s per experimental set. In one of the treatments the participants worked 
alone. These Ss received reinforcing visual feedback from "teaching 
machines" after responding subvocal ly. Since responses were covert, no 
acquisition measures could be obtained from them while learning was In 
progress. The other isolated treatment involved a single ^ at a time 
giving oral responses to an experimenter who watched the S_ operate a 
"teaching machine." For this group a measure of learning was recorded by 
the attendant experimenter. 

The experimental conditions provide acquisition learning curve data 
for Condition 1, direct oral responding in the group of eight, and for 
Condition 4, oral responding of isolated Ss In the presence of an experi- 
menter. Comparable tasks were used on each of three days. These tasks 
have been designated A, B, and C, Each task involved twenty problems 
(a German noun stimulus, and two English noun response choices); thus, 
there was a total of sixty items. Each one of these was presented four 
times with the four presentations scattered through the eighty trials of 
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the series. 

Acquisition learning curves are shown in Figures 2.03 and 2.04. The 
points plotted on the curves were obtained as follows: (1) the performances 

of all S,s were summed together separately for the first, for the second, 
for the third, and fourth presentations of specific items; (2) a mean was 
determined for each of the four presentations; (3) these means were then 
converted to a scale of 20, since 20 items constituted each day's task. 

To illustrate, there were 48 subjects involved in Condition 1. Each 
responded orally to five items per day. This resulted in 240 first 
trial subject-items. Had it happened that on the first trial there had 
been 120 correct responses, the mean performance would have been .5. Such 
a performance would be placed at ten on the vertical scale in Figure 2.03, 
since 20 items constitute one day's task. In reality, as seen in Table 
2.01, the first presentation performance was in fact only slightly higher, 
or 10.25. Since there were two alternatives for each item in a twenty- 
item task, the first trial performance level of these ^ ( approximately 
.5) is very near the mean chance expectancy. Six first trial measures 
(two groups — three days) are all consistent in this respect. 

This is fairly good evidence that mean association values are near 
zero, but slightly and consistently positive. That is, taking the task 
as a whole, negative association values are practically equal to positive 
values. The learning subject chooses almost, but not quite, as many wrong 

responses as he chooses right ones. 

No evidence is available from these curves to indicate the extent of 
differential association value when considering individual items. There 
may be great disparity, or the association values may be homogeneous. 
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Figure 2.03 Mean performances for direct reinforcement subjects (oral respondents in grou 
on two-choice acquisition trials and delayed test. 



TABLE 2.01 



MEANS AND STANDARD DEVIATIONS FOR DIRECT REINFORCEMENT Ss 
IN THE GROUP SITUATION ON ACQUISITION TRIALS CONSISTING 
OF FOUR PRESENTATIONS FOR EACH STIMULUS WORD AND 
WHERE EACH TASK INCLUDES 20 WORDS 
SHARED BY FOUR Ss 







Presentations 






1st 


2nd 


3rd 


4th 




X SD 


X SD 


X SD 


X SD 


MONDAY 
TASK A 


10.25 

2.22 


14.0 

2.48 


15.75 

2.1 


17.17 

2.33 


TUESDAY 
TASK B 


10.25 

1.50 


13.83 

2.31 


16.75 

2.5 


17.75 

1.5 


WEDNESDAY 
TASK C 


10.25 
1 .89 


14.58 

2.4 


15.92 

2.18 


17.17 
1 .82 



Note; Twenty items having two alternatives were Included in each 
task. The mean expectancy on the first presentation was chance, or 
10 correct. 
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TABLE 2.02 



MEANS AND STANDARD DEVIATIONS FOR CONDITION k (ISOLATION-TEACHING 
MACHINE AND AURAL FEEDBACK) ON AC(iUISITION TRIALS CONSISTING 
OF FOUR PRESENTATIONS FOR EACH STIMULUS WORD AND WHERE 
EACH TASK INCLUDES 20 WORDS PRESENTED TO EACH S 







Presentations 








1 St 


2nd 


3rd 


4th 




X SD 


X SD 


X SD 


X SD 


MONDAY 
TASK A 


11 .38 
2.86 


12.25 

2.59 


13.55 

2.95 


14.93 

3.56 


TUESDAY 
TASK B 


10.63 

2.69 


12.10 

3.33 


13.30 

3o4l 


15.10 

3.59 


WEDNESDAY 
TASK C 


10.35 

3.31 


12.63 

2.68 


14.9 

2.22 


16.65 

2.63 



Note: Twenty items having two alternatives were included in each 
task. The mean expectancy on the first presentation was chance, or 10 
correct . 



More information will be offered on this in the section on identifica- 
tion or copying behavior. Without sufficient evidence to show that 
individual items were equivalent in terms of relative association values 
it was necessary to use identical items on both sides of all comparisons. 
That is, all learning conditions involved the same task, item for item. 

Looking at the curves for the direct reinforcement group (Figure 2.03), 
the reader will note a slightly negative acceleration. There is a more 
rapid rise at the second presentation than is the case for isolated ^s 
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(Figure 2,04), The advantage remains with the direct reinforcement group 
for all presentations, over all three days. 

Acquisition for the isolation group was almost perfectly linear. 

In addition, the reader will note a nearly identical performance on 
Monday to that of Tuesday and Wednesday, In relation to the consistency 
of these learning trials the criterion performance measured on Friday is 
of notable interest. The graphic comparisons afford a ready grasp of the 
original le-arning and later forgetting pattern (Figure 2.04). When learning 
is measured by a delayed test, notice that primacy in learning holds a 
strong advantage over recency on this task. Monday's material is 
remembered fai better than is Tuesday's or Wednesday's (Sig. <.001, F = 
82.7 with 2 and 468 d, f , , see Table 2.03). 

Criterion Measures of Retent i on — Analysis of Variance of Treatments 
by Day of the Learning Trials 

In order to examine the major outcomes from the retention data 
collected with the recognition test on Friday, a four by three analysis of 
variance design was employed. Differences in the four learning conditions 
were compared on one dimension and days of the learning trials on the othe 

Only the first ten experimental sets were used. This resulted in 
40 _Ss per learning condition being used. Table 2.03 summarizes the 
analysis of variance. As already reported in connection with the learning 
curves, the effect of days was highly significant. The first day's task 
was remembered better, at a highly significant level, than the following 
two days. The lesser ability to remember the more recent material was 
not quite so dramatic for Condition 3» isolation, as for the others. 

An interaction effect (Sig, beyond ,01) resulted largely because of this. 
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TABLE 2.03a 



MEAN TEST PERFORMANCE ON 
FRIDAY FOR EACH DAY'S TASK 



1. 


Direct 2. 

Learning 


Vi carious 
Learni ng 


4. 

3. Machine 
Feedback 
Only 


Machine Feed- 
back plus 
Teacher Feed- 

bc ck 


Monday (20 items) 


12.80 


11.85 


11.73 


10.03 


Tuesday (20 items) 


8.40 


8.03 


9.18 


9.43 


Wednesday (20 items) 


7.40 


6.43 


8.33 


6.90 


Total for entire task 28.59 


26.31 


29.22 


26.36 


TABLE 2.03b 



ANALYSIS OF VARIANCE FOR LEARNING C0NDIT80NS 
AND DAYS OF THE LEARNING TRIALS 



Source of Variation 


Sum of Squares df Mean Square 


F 


Requi red 

.05 .01 


Between days 


1554 


2 


777 


82.7 


2.99 


4.60 


Between learning 


92 


3 


31 


3.3 


2.68 


3.95 


condi t i ons 














1 nteract i on 


198 


6 


33 


3.51 


2.17 


2.96 


Within 


4419 


468 


9.4 








Total 


6263 


479 
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Isolation Condition 3» where the ^ learns the task as it is presented 
by a teaching machine, had the highest total performance. While this 
learning condition manifests the highest over-all performance, and the 
highest terminal performance (Task C, learned Wednesday), its learning 
was next to the poorest on Task A (learned Monday). Again, this evidences 
the significant interaction effect of days of learning and treatment 
condi t i ons. 

The learning conditions also had significantly differential effects. 

In order to determine where such differences lay, a Scheffe Test of Multiple 
Compar i sons was carried out. The results are presented in Table 2,04, 

TABLE 2.0k 



SCHEFFE TEST OF MULTIPLE COMPARISONS 
FOP LEARNING CONDITIONS TABLE 2.03 
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Condition 1, consisting of four ^s in a group of eight who responded 
orally in turn, had a total performance not significantly different from 
Condition 3 (isolation), the highest performing group. Had the learning 
trials been extended another day or two. this might not have been the casej 
for Condition 1 had a higher relative rate of performance decay over days 
than did Condition 3» 

An inspection of the Scheffe Test shows that Condition 3 (isolation) ^ 
exceeds Condition 2 (4/4 vicarious) beyond the .01 level of significance. 
The mean for Condition 3 (isolation) was also reliably greater than that 
for Condition 4 (isolation with experimenter feedback) beyond ps.Ol. 
Condition 1 subjects were also reliably superior to Conditions 2 (4/4 
vicarious), and 4 (isolated — with experimenter feedback) but at a level 
of significance less than .01. The only comparisons not significant were 
2 and 4, and 1 and 3- 

At this point, it is necessary to remind the reader that Condition 1 
was a mixed direct and vicarious treatment. That is, two learning 
conditions were in effect for these ^s. One-fourth of all items were 
experienced by each of the four ^s directly. Each responded orally to 
particular items, in turn, and was reinforced with feedback by the experi- 
menter for a correct response. For the remaining three-fourths of the 
items these subjects had to learn vicariously, that is, by listening to 
the exchange between the active _S and the experimenter. Tables 2.05 to 
2.07 show the two performances in Condition 1 broken down as percentages 
according to the one-fourth — three-fourths learning modes. Direct 
responding in the group (1/4) was superior to all other modes of learning 
studied. In each measure it was as high as any other condition, or higher. 
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TABLE 2.05 

PERCENT OF ITEMS CORRECT ON TASK "A«' PRESENTED ON 
MONDAY, TESTED AS RECOGNITION ON FRIDAY* * 



condition: i (social) (N = 48) 

PROXIMITY TO EXPERIMENTER 

12 3 4 TOTAL 

Direct Aural Feedback (1/4 of total items) 

64 73 69 57 66 

Vicarious Aural Feedback (3/4 of total items) 

68 58 62 54 6l 

CONDITION 2 (social) (N = 48) 

PROXIMITY TO EXPERIMENTER 
ABC D 

Vicarious Aural Feedback (4/4 of total items) 

55 56 56 67 59 

CONDITION 3 (isolated) (N = 44) 

Feedback ; 1. visual by machine 59 



CONDunON 4 (isolated) (N = 40) 



Feedback: 1. visual by machine 

2. aural from experimenter 50 

*Based on a four—choice test making chance expectancy approxi- 
mately 25 percent. 






TABLE 2.06 



PERCENT OF iTEMS CORRECT ON TASK "B" PRESENTED ON 
TUESDAY, TESTED AS RECOGNITION ON FRIDAY 



CONDITION 1 (social) (N ^ 48) 



PROXIMITY TO EXPERIMENTER 
12 3 4 




TOTAL 


Direct Aural Feedback (1/4 of total items) 

44 50 53 42 

Vicarious Aural Feedback (3/4 of total items) 
47 41 36 43 


47 

42 


CONDITION 2 (social) (N = 48) 


PROXIMITY TO EXPERIMENTER 






ABC D 






Vicarious Aural Feedback (4/4 of total 


1 terns) 




33 40 41 45 




40 


CONDITION 3 (isolated) (N = 44) 


Feedback: 1. visual by machine 




46 


CONDITION 4 (Isolated) (N = 40) 



Feedback: 1. visual by machine 

2. aural from experimenter 47 
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TABLE 2.07 

PERCENT OF STEMS CORRECT ON TASK "C" PRESENTED ON 
WEDNESDAY, TESTED AS RECOGNSTSON ON FRIDAY 



CONDITION 1 (social) (N = 48) 



PROXIMITY TO EXPERIMENTER 

12 3 4 TOTAL 

Direct Aural Feedback (1/4 of total items) 

49 53 47 50 50 

Vicarious Aural Feedback (3/4 of total items) 

38 39 29 34 35 

CONDITION 2 (social) (N = 48) 



PROXIMITY TO EXPERIMENTER 
ABC D 

Vicarious Aural Feedback (4/4 of total items) 

37 33 30 35 34 

CONDITION 3 (isolated) (N =44) 



Feedback : 1. visual by machine 42 



CONDITION 4 (isolated) (N = 40) 



Feedback; 11. visual by machine 

2. aural from experimenter 35 
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The subjects who responded directly to 1/4 of the items were compared 
with respect to their knowledge of these items with the subjects who had 
observed these responses. This comparison was made by means of a t test 
for the scores on these items. The results are in harmony with the 
analysis of variance, showing a significantly better performance for 
direct participation than for vicarious participation (t = 2.30 with 47 
d.f., one-tailed test, p^.02) . This comparison was made prior to the 
analysis of variance and serves only to corroborate the finding already 
reported as significant there. 

The Data from the Subjects Who Learned Vicariously 

What has been called vicarious learning has been shown in the present 
study to be not as effective as learning by more direct means. The sug- 
gestion has commonly been made that the efficiency of learning varies 
according to the extent of the subject's involvement. An occasional 
response given orally in a group, as was the case for Condition 1, may 
have the capability of maintaining sufficient arousal in the respondent 
so that his learning efficiency for other items to which he does not 
respond is enhanced. In contrast, if he never makes an overt response, we 
might expect his involvement in the relevant learning activity to drop off. 

A comparison was made between the learning on the items to which the 
subjects responded directly and the learning for these same subjects on 
the items to which they did not respond directly. 

A sign test (Dixon and Massey, 1951) of the difference between the 
sets of data offers evidence of greater performance decline over the three 
days for items vicariously experienced than for those overtly responded to. 
That is, what is commonly considered to be the proactive inhibition effect 
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was greater for vicarious learning than for learning by direct experience. 
Table 2.08 s'lows how the 48 subjects performed on items directly experi- 
enced and on items vicariously experienced. The data originated from the 
retention test given on Fridays of each week of the experimentation. A 
plus sign indicates that in that experimental group on the day specified, 
direct reinforcement subjects learned proportionately more items by direct 
experience than they did by vicarious experience. A minus sign occurs 
where S_s learned more vicarious items than direct ones. A majority of 
measures favored the directly experienced items (p .01). It should be 
borne in mind that this is not a weighted test. The weightings which: 
have no power in the significance test may be seen as summed numerical 
differences favoring positive (direct), or negative (vicarious) treat- 
ments. The mean difference favors the direct Condition on Monday's task. 
The advantage favoring direct responding increases on Tuesday, and again 
on Wednesday, suggesting the hypothesis that attention (or whatever factor 
is operating) falls off faster for vicarious ^s than for direct S^s . 

A simple analysis of variance was carried out to test this hypothesis. 
Difference values (direct minus vicarious) were used as the basis for 
analysi s . 

As can be seen in Table 2.09, the trend failed to show significance 
at the .05 level . 

The trend suggesting more rapid decay of learning for vicarious ^s 
than for direct may be of sufficient interest to warrant an extension of 
this design to include an additional day or two of learning trials. This 
would permit a re-examination of the differential performance decay with 
the possibility of a significant trend showing up. 



TABLE 2.08 



SIGN TEST OF DIRECT 1/4 ITEMS VERSUS VICARIOUS 3/4 ITEMS* 
NUMERICAL VALUES SHOW THE DIFFERENTIAL ADVANTAGE (X-Y) 
HELD BY THE CONDITION OF THAT SIGN 



Task 




A - Monday 


B - Tuesday 


C - Wednesday 




Experimenta 1 


4 


+1 


1- 


+4 




Set 


5 


+ 1 


+4 


+ 1 






6 


+3 


+3 


+7 






7 


+2 


+4 


+ 1 






8 


+4 


+8 


+2 






9 


3- 


1- 


0 






10 


+2 


1- 


+4 






11 


2- 


3- 


0 






12 


2- 


0 


+6 






13 


1- 


+1 


0 






14 


+2 


1- 


+8 






15 


+4 


+ 1 


+4 




+ n 




8 


6 


9 




- n 




4 


5 


0 





+ ^d 
-Xd 


19 


21 


33 




8 


7 


0 


+ Xd 


2.38 


3.5 


' 4.12 


- Xd 


2.0 


1 .4 


0 



*A plus indicates an advantage for direct, a minus for vicarious. 
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TABLE 2.09 

TREND ANALYSIS: ANALYSIS OF VARIANCE OF DIFFERENCE SCORES 

(lA DIRECT MINUS 3A VICARIOUS) BETWEEN DAYS 



Source 


Sums of 
Squares 


df 


Variance 

Estimate 


F 


F required 
at .05 


Between days 


6.70 


2 


3.35 


2.11 


2.99 


Within days 


224.66 


141 


1 .59 






Total 


231 .36 


143 









100 Percent VI carious Versus Partial VI carious . 

An attempt was made to demonstrate a difference In learning for 
subcondition lb (3A vicarious) when compared with Condition 2 (4A 
vicarious) on the Items held In common by these treatments. A significant 
difference favoring lb would possibly Indicate a general arousing capa- 
bility for the condition where S^s give Intermittent responses. This 
arousing state might then be expected to manifest itself by higher perfor- 
mance on the 3A vicarious items. The contrast would be against the 
relatively deprived activity state of subjects in the k/k vicarious 
condition. Again, the learning for both conditions was by the vicarious 
mode, and the measurement made only on those I terns held in common by both 
sides of the comparison. 

The percentage Tables 2.05"2.07 indicate a higher performance level 
for 3 A than for 4A; I .e.. Task A percentage right for 3A vicarious 
I terns was 6l% and 4A items was 58%. Five of the six measures in the 
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tables favor ^/k, A t-test of correlated means, having 47 degrees of 
freedom was not significant, however. It must be assumed, therefore, 
until a more sensitive measure can be taken, that the difference is not 
reproducible. 

Proximity to the Experimenter . and to the Responding Subject . 

One of the questions asked in this study was whether or not there 
might be a difference in learning for near and distant seating positions 
in relation to the "teacher"-experimenter who was positioned near the 
center of the row. The hypothesis was that the closer proximity would 
result in greater learning. An inspection of the data for Condition 1 
(1/4 direct and 3/4 vicarious) indicated no substantial proximity position 
effect. Such was not the case with the 4/4 vicarious data. The recognition 
data appeared to favor seating position D over C and B, and they inturn 
over seating position A, The individuals at greater distances from the 
center of the row had higher performances, with the individual at the 
end showing up best. 

A four by three analysis of variance compared positions and days of 
the learning trials. The results of this analysis are shown in Table 
2 . 10 . 

indications are either that: 

1. The apparent proximity effect for Task A and B is simply a 
chance variation of the data, or, 

2. The effect is real but temporary; otherwise. Task C variance is 
simply a chance fluctuation from the "true" population value. 

Table 2.10 shows that the proximity effect across three learning tasks 
combined was not significant, though it does not fall far short of the 
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5 percent level. The interaction term was not significant. 



TABLE 2.10 

ANALYSIS OF VARIANCE FOR PROXIMITY POSITIONS* 
AND DAYS OF THE LEARNING TRIALS 















Source of 
Variation 


Sums of 
Squares 


df 


Mean 

Square 


F 


Requi red 
.05 .01 



Between days of the 
learning trials 


638 


2 


319 


43.7 


3o09 


4.82 


Between vicarious 
posi tions 


55 


3 


18.33 


2.5 


2 .70 


3.98 


Interaction 


45 


6 


7.5 


1 .03 


2.19 


2.99 


Wi thi n 


961 


132 


7.3 








Total 


1763 


143 











*Positions In Condition 2, 4/4 vicarious. 



Vicarious ^s proximity to the source of responding was also examined. 
There is the circumstance where a Subject "D” is seated at the end of a 
row of eight subjects. In this event the question may be asked: Does he 

(Subject D) learn items better that are responded to by a Subject (3) 
who is seated immediately next to him, or does he more easily learn the 
responses of a distant Subject (4) who is at the opposite end of the row? 

A one-tailed hypothesis held that items presented to S_ (3) would be learned 
better by vicarious ^ (O) than those items presented to ^ (4). This was 
expected because of a supposed greater arousing potential for a near by 
stimulus-response exchange. While differences were in the expected 



2.30 



n 



direction, they failed to reach an acceptable level of significance. A 
t-test of the differences between means for correlated data was 1.24 with 
35 degrees of freedom. At the .05 level, a t of 1.697 is required. 

Pi scussi on 

Some significant differences were found among conditions of learning 
considered here. These may perhaps be very important differences. When 
performances for the three acquisition days were combined on the criterion 
test, the 4/4 vicarious scores totaled to 82 percent of the 1/4 direct 
scores. The difference between these two data groups was the greatest 
of any of the comparisons made. In appraising these circumstances one 
needs to keep In mind at least two relevant points: (1) a trend, which 
in this study fell short of significance at the .05 level, suggested that 
the difference separating direct and vicarious experience increased as time 
over the learning trials increased, and (2) the prevailing experimental 
conditions were distinctly £d hoc , suggesting that the attention level of 
vicarious _Ss was probably higher than might be expected under circumstances 
where adaptation to the environs had occurred. 

Statistically significant differences between the four treatments are 
indicated below by symbols (x) and (y) . The same symbol appearing before 
two learning conditions indicates that the mean performances in the two 
conditions do not differ at the .05 level. Different symbols indicate 
significant differences of at least .05. 

(x) Condition 1. Two conditions operating: 

(a) 1/4 of the items were directly responded to. 

(b) 3/4 of the items vicariously experienced. 
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(x) Condition 3* so 1 ate d f rom other ^s. Working clone 

at a "teaching machine" from which visual feedback was 
afforded. 

(y) Condition 2 (4/4 vicarious). All items are experienced 
by observing the ^s in Treatment 1 respond. 

(y) Condition 4. Isolated and working under the same condi- 
tions as Treatment 3 except that an experimenter was 
present to give aural feedback as an additional condi- 
tion. 

Condition 3 (working alone) had the highest over-all performance. 

This was so in spite of a comparatively poor first day's performance. 

Both Conditions 3 and 4 (isolated conditions) had poor beginnings when 
compared with Conditions 1 and 2 (group conditions). By the third day of 
learning trials, however, the Condition 3 isolates were the best performers 
and sufficiently so to give this treatment the highest over-all performance 
(though not significantly different from Condition 1). These outcomes 
give rise to the following questions: 

1. In the isolated conditions the ^s had control over 

presentation of their own items by use of a "teaching 
machine." The to-be- learned material was adapted to 
"teaching machines" with the format of each item being 
comparable to the card presentation shown in Figure 2.02 
cards used to present the task to groups). The question 
arises as to whether or not poor initial performance 
could be accounted for in terms of time needed for adapta- 
tion to the unfamiliar mechanical device. Attention is 
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expended in part on various of the unfamiliar elements 
of the new environment. When there are distracting 
conditions, one might speculate that relevant attention 
is diminished to the extent that distractors use up the 
channel capacity of the perceptual system. However, 

Azrin (1958) has shown that noise inhibits discrimination, 
if the discrimination had been originally learned under quiet 
conditions. Conversely, when the discrimination behavior 
had been learned in the presence of noise, quiet was then 
disrupting. Habituation to the environment apparently 
can permit more effective selection of stimuli to be 
admitted to the perceptual system. 

2. The reader will recall that Condition 1 actually involved 
two learning conditions (1/4 direct responding, and 3/4 
vicarious responding). Even though Condition 3 (isolation) 
had the highest total performance, if the learning of 
specific items is examined, then the items directly 
responded to and directly reinforced in the group situa- 
tion were more effective. Even on the third day of learning 
trials when isolated Condition 3 ^s had their best day, 
learning was still inferior to the direct oral response 
and reinforcement mode in the group situation. When 
1/4 direct and 3/4 vicarious items were combined in 
Condition 1, the level of performance was slightly less, 
and not significantly different from, isolated Condition 3» 
Could one perhaps account for the generally superior learning 
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under the oral response mode in the following way: 

a. When a ^'s name was called, in his turn, indicating that 
he should make the oral response, such a response required 
an obligatory orientation to the task, at least, atten- 
tion had to center here sufficiently long for the 
subject to vocalize a response. For the other ^s, 
vicariously experienced items (observation of another's 
responding) entailed no such necessity for an arousal 
posture appropriate to the learning task. The sub- 
ject was "free" to engage in other activity or to 

dream, if so inclined. The fact that the greatest 
learning, of all of the conditions of learning studied, 
occurred as a consequence of oral responding, and the 
poorest learning occurred in the passive situation 
of sitting in the presence of those making oral 
responses is suggestive that the study of attention 
processes lead to an understanding of the nature 
of certain learning mechanisms. 

b. In addition, it is quite conceivable also that intermittent 
direct responding is capable of elevating arousal level. 
This higher general activity level might then increase 

the subjects sensitivity to task stimuli. 
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CHAPTER III 



LEARNING AS A CONSEQUENCE OF THE LEARNER'S TASK INVOLVEMENT 
UNDER DIFFERENT CONDITIONS OF FEEDBACK 



I. 
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In the study reported in the previous chapter subjects 
learning in groups of eight were exposed to two main learning conditions. 
Four of the subjects learned by a recitation procedure, interacting with 
the experimenter who functioned as a teacher. These subjects were referred 
to as the direct subjects. The remaining four were able to learn the task 
only by observing the performance of the direct subjects and their inter- 
action with the experimenter. The latter four subjects were referred to 
as observer subjects and, according to current custom, their learning 
was designated as vicarious learning. The task involved in the study was 
the acquisition of German vocabulary. The exper imenter- teacher presented 
the German words, one at a time, on large cards accompanied by two English 
words one of which was the equivalent of the German word. The words 
were printed in large letters clearly visible to the entire group. After 
a direct subject, designated by the experimenter, selected one of the 
English words as the equivalent of the German word, he was either told 
by the experimenter "That is right," when it was correct, or was told 
nothing when the response was incorrect. Although the observer subjects 
were provided with an equal amount of information when the direct subject 
responded correctly as when he responded incorrectly, the data suggested 
that they learned better when they observed a correct response than when 
they observed an incorrect one. This led to the hypothesis that the 
specific nature of feedback provided by the experimenter in such a situation 
would be an important factor in determining the learning of the observer 
subjects. This is the central problem on which the present study focuses. 

While little research has been undertaken on learning in situations 
which simulate those of the classroom, a considerable amount of information 
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has been obtained on the relationship of feedback to learning under 
conditions where only one learner is involved at a time. 

Most of the studies have been concerned with the effect of the 
experimenter saying right or wrong or nothing or combinations of these. 

The results of such research are difficult to fit together into a con- 
sistent pattern because of the varied nature of the subjects used. 

Buss, who has conducted a series of researches with various asso- 
ciates (Buss, Wiener, and Buss, 195^; Buss, Braden, Orgel and Buss, 

1956 ; Buss and Buss, 1956| and Ferguson and Buss ^1959), have generally 
found that the combination of no comment (N) for correct responses and 
“wrong" (W) for incorrect, or "right" (R) for correct responses and 
"wrong" (W) for incorrect, produces more rapid learning than the combination 
of R-N. Buchwald (1959a, 1959b) has taken the position that N acquires positive 
reinforcing properties when given in the combination N-W, and negative 
reinforcing properties in the combination N-R. Meyer and Seidman (I960) 
found that no comment appeared to have reinforcing properties with an 
8-9 year old group but not with a ^-5 year old group. The younger, 
pre-kindergarten group seemed unable to utilize the information provided 
by silence on the part of the experimenter when the response was correct. 
Clearly, providing the best form of feedback is a relatively complex 
problem even in situations involving only one learner. The present 
study explores the relative effectiveness of different forms of feed- 
back in a simulated classroom situation in which there are pupils who 
interact with the teacher and pupils who learn by observing the inter- 
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Method 

Experimental Des ign 

In the present study, as in the previous one, a simulated classroom 
situation formed the context of the study. Eight "pup i 1”- subjects sat 
in a row facing the experimenter who functioned in the teacher role. 

The experimenter interacted with the odd numbered subjects, but not with 
the even numbered ones, except on a single demonstration trial when he 
Interacted with all Ss. The experimenter presented a German word 
together with two English words which were printed in large letters on 
an 11" X 14" card. When presenting a card, the experimenter turned to 

a designated subject and said, "Your word is " (German word). 

The designated subject then read both English words and selected one as 
the correct translation. After the subject had made his selection, he 
received knowledge of results according to the condition to which his 
group had been assigned. The four feedback conditions were as follows; 

Condition 1 

a. S correct - "That's right." 

b. S incorrect - "No, that's wrong." 

Condition 2 

a. S correct - E said nothing when S made correct response. 

b. S incorrect - "No, that's wrong." 

Condition 3 

a. S correct - "That's right." 

b. S incorrect - "No, that's wrong, (German word ) means 
(Engl i sh word) . ' ' 
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Condition 4 

a. S correct - E said nothing when S made correct response, 

b. S incorrect - "No, that's wrong, (German word ) means 
(Engl i sh word ,) " 

Nine groups of eight subjects each were assigned to each one of 
the conditions and in each group four of the subjects, the direct 
learners, interacted with the experimenter, while four, the observer 
subjects, learned the task by observing the interaction. The work was 
undertaken in an empty room of a school. 

Sub iects 

The subjects (N = 288) were fourth, fifth arsd sixth graders drawn 
from three public elementary schools in Salt Lake City, Utah. Two of 
the schools were located in an older residential section of the city, 
and the third school was situated in a newly developed suburb. 

Task 

The task in the present study consisted of learning to match 
sixty German words with their English equivalents. This learning occurred 
over three consecutive days beginning Monday. The design of the task has 
been fully described in the previous study already referred to. 

Procedure 

Three learning sessions for each group took place on Monday, Tuesday 
and Wednesday mornings. Sessions lasted between twenty-five and thirty 
minutes. Usually three experimental groups were seen per week. On 
Friday a recognition test was given to that week's subjects to measure 
the amount of word- learn ing which had taken place. 
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On the basis of either reading scores, (obtained from the Metro- 
politan Achievement Test or the Science Research Associates Battery), 
or an intelligence quotient (from the Pintner General Abilities Test) 
eight, sixteen, or twenty- four subjects were selected from each of the 
fourth, fifth and sixth grade classes available in a school. Reading 
scores or I.Q. 's were ranked for each class, and subjects chosen count- 
ing up and down from the mean score. Thus experimental groups chosen 
from a class were kept as homogeneous as possible with respect to reading 
grade placement or I.Q. of the group's members. If a chosen subject 
was absent on Monday, another class member was substituted, keeping 
the reading score, or I.Q,. as equivalent as possible to those of the 
rest of the group. If a subject was absent on a Tuesday or Wednesday, 
a substitute from his class was chosen for the learning days, but cniy 
the data of the original subject was used in the analysis. Children 
with any previous knowledge of German were eliminated from the sample. 

In one or two cases, it was discovered after the experiment had been 
run that a child with a German background had been used as a subject. 

In those cases, the subjects' data was not used in the analysis. 

After assigning subjects to seats in the row of chairs, which 
was six feet in front of E, the experimenter gave the group the follow- 
ing instructions: 

"Although this is part of school, it is more like a game than 
anything else. What we do here won't have any affect on the 
rest of your grades. Only Miss H (the second E, who acted as 
recorder and observer) and I will know what you do here. You 
vji 1 1 be learning some words from a foreign language, but to 
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make it more fun, I'm not going to tell you what language you're 
learning until we're all through working in your school. 

It is important that you pay very careful attention, even though 
I may not be talking directly to you. Don't worry about your 
turn to answer. Instead, try to learn the meaning of the foreign 
word. Some people won't be asked for many answers, but we'd 
like everyone to learn all the words he can. You'll be learning 
words Monday, Tuesday and Wednesday. On Thursday we won't come 
•at all. But on Friday we'll come again and see how well you 
remember the words you learned. So try to remember as many words 
as you can. 

(At this point E put a task card on a stand which was placed 
on a table in front of her.) 

"Now let's practice. I'll call somebody's name, and then I'll 
read the foreign word aloud. The person whose name 1 call, will 
then read aloud the two English words which are underneath the 
foreign word. Then that person will guess which one of those 
two English words means the same thing as the foreign word. The 
person 1 call on will know if he's right or wrong because 1 
will tell him so, (in the case of Conditions 1 and 3). (When 

using Conditions 2 and 4, E said, "If (S) makes a wrong 

guess. I'll tell him so, but if he's correct, 1 won't say anything 
at all.") I'll call on only one person at a time, and only he 
should answer. Let’s have a few practice tries. O.K. (one of 
observer S's names) , your word is 'Stirn.' Read the two English 
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words aloud and then tell me which one you think means the same 
thing as the foreign word. At first it will be like a guessing 
game, but after a while you'll be more sure of your answer." 

Miss H. Is the score keeper for this game. (S read the words, 

E gave appropriate feedback, and E gave a practice word to each 
of the three other observer subjects.) 

"All right, are there any questions? Now let's start with 
the first word that's really part of the game." 

The other features of the procedure were the same as those involved 
in the study reported in the previous chapter. Twenty new words were 
learned each day on Monday, Tuesday and Wednesday, and the retention 
test was administered on Friday.* The task was presented on the same 
cards as were previously used. In administering the test, the experi- 
menter read each German word aloud and then paused about ten seconds 
for the subjects to respond. 

Resu 1 ts 

Effects of Feedback Mode . Sub iect I nvolvement . and Days of Learning 

(1 

An analysis of variance based on the recognition test data from 
288 Ss is provided in Table 3.01. This test data, obtained on Fridays, 
provided separate scores for the learning which had occurred the pre- 
vious Monday, Tuesday and Wednesday. The data also provided separate 
measures for those Ss involved directly In learning German vocabulary 
and for those who learned by observation only. In addition, equal 
numbers of subjects learned under each of four feedback (knowledge 



3.09 



II 



Table 3.01 



ANALYSIS OF VARIANCE FOR FEEDBACK CONDITIONS 
DAYS OF LEARNING, AND EXTENT OF SUBJECT INVOLVEMENT * 



Source of Variation 


df 


Mean Square 


F 


Requ i red 

.05 .001 


Between Feedback 












conditions (L) 


3 


65,3 


5,8 


2.6 


5.4 


Between Direct (C) 












learners and 












observer learners 


1 


228.0 


20.3 


3.8 


10.8 


Between day? (R) 


2 


1073.5 


99.1 


3.0 


6.9 


R X C 


2 


38.0 


3.4 


3.0 


6.9 


R X L 


6 


10.0 








C X L 


3 


3.0 








R X C X L 


6 


6.3 








Within 


840 


11.3 








Total 


863 











*For basic data refer to Table 3.03 
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of results) conditions. It is apparent from Table 3*01 that all main 
effects were highly significant. 

A Scheffe Test of Multiple Comparisons was used to assess differ- 
ences between individual feedback conditions. These comparisons are 
given in Table 3.02. A relationship seems to exist between information" 
content of the feedback condition and extent of learning. For example, 
under Conditions three and four, which were the most i nf ormat i on- 1 aden 
of the four feedback conditions, highest performance on the criterion 
test occurred. Learning under feedback Condition four was slightly, 
but not significantly, better than under Condition three. 

Under the latter two conditions not only was a wrong response indi- 
cated, but the stimulus word and appropriate response were repeated 
by E in close temporal contiguity. This circumstance provided redundant 
information, since, in a binary choice situation saying, "No, that's 
wrong" gives complete information concerning the correct response choice, 
It should be noted that the correct response was the last phrase heard 
by the subjects under feedback Conditions three and four. 

Condition three provided, in addition tQ the feedback given in 
Condition four, the utterance, 'That's right," given by E when S res- 
ponded correctly. This additional phrase, however, did not seem to 
add to the effectiveness of the feedback. 



Condition one differed from condition three only in that it failed 



to add the phrase "_ 



means 



Accord- 



ing to the Scheffe test learning under Condition three was not signi- 
ficantly more effective than under Condition one. 
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LEARNING PERFORMANCE ACCORDING 
TO FEEDBACK CONDITIONS * 



Source of variation 






F^ 


P 


A. 

4 - No, that's wrong, 


^means 




46.31 


.001 


versus 

2 - No, that's wrong 
B. 

3 - That's right (and) 


No, that 's wrong __ 


means 


29.85 


.001 


versus 

2 - No, that 's wrong 
C. 

1 - That's right (and) 


That 's wrong 




15.99 


.01 


versus 

2 - No, that's wrong 

D. 

1 - That's right (and) 


That's wrong 




7.88 


.05 


versus 

4 - No, that's wrong, ^ 
All other comparisons not 


means 

: significant 











*For basic data refer to Table 3»03 
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It Is obvious that feedback Condition 2,(»'That's wrong, ”)was 
significantly Inferior to all other conditions studied. No redundant 
Information was provided by the experimenter and perhaps of consequence 
Is the fact that "No, that's wrong," emphasizes a wrong response; thus 

Increasing Its availability for some Ss. 

Primacy In learning was superior to recency under all conditions 
studied (P<.001). That Is, the words learned on Monday were, under 
all feedback conditions, better retained on the Friday test than those 
learned on Tuesday or Wednesday. That which was learned on Tuesday was 
also better retained than the learning accomplished Wednesday. 

Subjects directly Involved In learning the task (odd-numbered Ss) 
were thus Involved for only one-fourth of the task Items while they 
learned as observers during the presentation of three-fourths of the 
Items. This Is to say that each of the directly Involved subjects 
received only five of the twenty Items presented on Monday, on Tuesday 
and on Wednesday directly. The remaining fifteen Items could be learned 
each day by observing the Interaction of other Ss with the experimenter. 
These two different conditions for learning were combined (1/4 + 3/4) 

In the analysis of variance In Table 3*01» 

Since the learning for direct Ss was measured separately on those 
Items learned directly and those learned by observation, these perform- 
ances together with those for Ss whose total learning experience was by 
observation have been represented as percentages of the task learned 
and are provided In Table 3.03. In the previous study a knowledge-of- 
results condition "That's right" was employed In an experimental paradigm 
similar to the one used here. The percentage values from the earlier 
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Table 3.03 

PERCENTAGE OF THE TASK RETAINED AT THE TIME OF THE FRIDAY TEST 







Direct 

1/4 items 
direct 


subjects 

3/4 items by 
observation 


Observer subjects 

all i terns by 
observat i on 


Monday's task 
(20 i terns) 


Feedback 


1 


68 


71 


66 


Condit ion 


2 


62 


63 


63 




3 


69 


65 


67 




4 


76 


69 


71 




^5 


72 


62 


60 


Tuesday's task 
(20 i terns) 


Feedback 


1 


54 


51 


48 


Cond 1 1 i on 


2 


61 


49 


46 



3 

4 



66 

61 



59 

5 ^ 



50 



51 
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Table 3*03 (Continued) 



Direct 


subjects 


Observer subjects 


1/4 items 


3/4 items by 


all i terns by 


di rect 


observation 


observat i on 



Wednesday's task 
(20 items) 



Feedback 


1 


65 


51 


46 


Cond i t i on 


2 


58 


45 


43 




3 


63 


52 


45 




4 


68 


52 


46 




»5 


52 


35 


33 


Total task 
(60 items) 










Feedback 


1 


62 


58 


53 


Cond i t i on 


2 


60 


52 


51 




3 


66 


59 


54 




4 


68 


58 


56 



®N=72 Under each feedback condition 

*^Feedback condition - "That's right" from earlier study 
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study are included in Table 3 . 03 as feedback Condition 5. While Ss 
in the two investigations were from different elementary schools, with 
a different person, but of the same sex serving as the experimenter, 
other conditions such as grade in school, sex of the subject, and read- 
ing test scores appeared to be comparable. The "That’s right" feedback 
mode resulted in less learning than any of those from the present study. 

Retention of the task was measured again after the passing of 
about five months and is given in Table 3.04 as percent retained. This 
measure was acquired at only one of the schools where the sample was 
originally drawn. The retest was administered to I 76 Ss, but these were 
not equally distributed across feedback conditions as may be observed 
in Table 3.0^. It is worth noting that the proactive inhibition effect 
(advantage for primacy over recency) was still apparent, that is, 
Monday’s task was better retained than Tuesday’s, etc. In general, it 
appears that the four reinforcing conditions retained approximately the 
same relative order of effectiveness, with Conditions 3 and 4 being high 
and Conditions 2 showing poorest retention. 

Interactions . . As may be noted in Table 3.01, an interaction 
between extent of subject involvement in learning and days of learning 
was significant at the .05 level. This interaction does not take into 
account the fact that the direct subjects learned under two conditions 
and, when these two conditions are separated, an additional component 
is added to the interaction. This is illustrated in Figure 3.01. 

Variance due to other interactions was in each case not significant. 

These included feedback-mode conditions by days, feedback-mode by extent 
of involvement, and the triple interaction of all main effects. 



Percent of Task Learned 
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Day Learning Occurred 



Fig. 3.01 Interaction between extent of subject involvement 
and days when learning occurred. 
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Discussion 

Each one of the four learning conditions described provided all of 
the information necessary for the acquisition of correct responses, but 
the feedback differed in the amount of redundancy provided. The situa- 
tions thus differed from those studied by Buss, Buchwald and others for 
they provided different forms of feedback giving equal information. 

The amount of redundancy is related to the degree to which the task is 
learned - with greater redundancy favoring learning. Of interest is 
the fact that those forms of feedback, in which the last item of informa- 
tion transmitted was the correct response , were significantly more, effec- 
tive than those in which other information was the last transmitted. 

This finding is in contrast with studies of R, N and W as reinforcers 
in which the combination R-N was generally proven to be less effective 
than the other two combinations. 

Another point of interest raised by the study is that the subjects 
who interacted with the experimenter performed better not only on the 
items on which they interacted but also on the items which they learned 
by observation. The data suggest the interpretation that the direct 
interaction procedure raises the level of arousal of the direct subjects 
which, in turn, influences acquisition on the items which they learn 
by observation. 
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CHAPTER IV 



RELATIONSHIP OF LEARNING TO ATTENTION 



IN SIMULATED CLASSROOM SITUATIONS 
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While teachers and psychologists agree that the degree of acquisition 
of a skill is highly dependent upon the learner maintaining an orientation 
towards the task, there is considerably less agreement on what constitutes 
evidence that the learner is task oriented. In a typical verbal transaction 
in the classroom between teacher and pupils, such as occurs during recita- 
tion periods or during class discussions, teachers typically demand that 
each of the pupils direct his gaze towards the teacher or the discussant de- 
spite the fact that the information is being transmitted through the ear. 
Whether this required behavior facilitates the acquisition of knowledge is 
an open question. One can understand that when the information to be ac- 
quired is to be transmitted through the eyes that the direction of the gaze 
might be an important matter. However, when the information is transmitted 
through the auditory channel such a visual orientation would appear to be 
quite unnecessary. The data derived from the previous experiment permits 
the investigation of problems such as these since under two of the condi- 
tions information had to be obtained through the visual channel but under 
two conditions all of the necessary information was provided through the 
audi tory . 

In accordance with custom, the term attention will be used here to refer 
to what was described in the previous paragraph as task orientation . While 
the latter term refers to the matter of whether conditions of posture and 
conditions within the nervous system are set so that information from a 
particular source can be most readily received, only those aspects which in- 
volve posture can be readily observed. The assumption of many teachers is 
that the postural components are highly correlated with the observable com- 
ponents of attending, no assumption is made that these are correlated with 
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the components that cannot be observed directly. 

Method 

The Learning Task . The experimental arrangements were those de- 
scribed in Chapter 3. The task consisted of the learning of 60 German 
vocabulary words by members of an eight-member group. There were two 
primary conditions for learning: By direct verbal interaction with the 

experimenter and through obersvation or vicarious experience (observation 
of the direct respondent). In the case of direct learning, E held a 
card (11" X 14") providing an item, read the German word from the card, 
and called the name of a subject, who was to respond orally. The S then 
read aloud two possible English equivalents from the card, made a choice 
and received knowledge of results from E. Observational learning 
occurred as the remaining seven subjects observed the interaction of the 
direct respondent and experimenter. 

The procedure followed by the experimenter in presenting the task was 
approximately as follows: E placed one of the cards (11" x 14") on a 

stand positioned on a large table before the Ss. She then addressed a 
subject by name, saying for example, "John, this word is Baum." In re- 
sponse, the subject read both English alternatives as the equivalent of 
Baum from the card and made a choice. The S might have responded "tree," 
the correct choice, or he could have said "corn," the incorrect response. 
Knowledge of results was provided from one of four categories: 

Cond i t i on I 

(nothing said when S was correct) 

"No, that's wrong" 




4.03 



il 



Condition 1 1 

“That's right” 

"That's wrong" 

Condition ill 

"That's right" 

"No, that's wrong, means 

^°'^*^|slot^?ng'^sa id when S was correct) 

"No, that's wrong, means 

The point to note is that the first two forms of feedback require 
the learners to direct their eyes towards the visually presented material, 
while in the case of the last two forms of feedback, the information can 
be obtained entirely by listening. 

Design and Procedures 

Thirty-six eight member groups were involved in the study. The sub- 
jects were the same as those in the previous study. Each group of eight 
was seated in a straight line facing the experimenter who stood about 
seven feet distant from the center of the row. Half of the subjects, the 
odd-numbered ones across the row were designated as direct subjects, that 
is, each one gave an oral response to one of the items. While one respond- 
ed, the remaining three observed and attempted to learn the item. For 
those subjects that responded, one-fourth of the items were learned by 
verbal interaction and three-fourths by observation. Four Ss in the even 
numbered positions made no overt response at any time while the task was 
presented except on an initial practice trial. These Ss were referred to 
as observer subjects or as vicarious Ss. They depended entirely on obser- 
vation for learning. Due to the fact that direct responding required an 
obligatory orientation to the task (the S could not escape looking, reading 



and responding), manifestations of attention could be recorded only when 
a subject was involved as an observer; thus while one was directly in- 
volved, any one of seven Ss could be observed for evidence of relevant 
attention. Seated at the side of the room was an observing experimenter 
who, according to a schedule designating which S was to be observed, made 
the following objective recordings: (a) initial attention ; credited as 

positive if the observed S was looking at the card for the entire interval 
beginning when E called the name of a direct subject until she had read 
the German word ( a rapid glance away but returning again did not negate 
the recording), (b) concluding attention ; was recorded positive if the 
S looked continuously from the time the direct S spoke his choice of re- 
sponse until E put the card down (a total of about five seconds). The 
objective assesment of attention was recorded on an observation schedule 
which also include the information as to the correctness of the direct 
S's response to each item each time it was presented. 

At the conclusion of each days trials ^ the experimenter who had admi 
istered the task recorded her subjective estimation of which two of the 
eight Ss in the group had attended best and which two had attended least. 

Learning trials occurred on three days: Monday, Tuesday, and Wednes- 

day. On each of these days, 20 items were introduced with four trials on 
each. The four presentations of an item were programmed intermittently 
throughout the total of 80 item presentations per day so that it could not 
be anticipated which item would occur next in the series. 

Test of Learning . On a separate Friday for each experimental repli- 
cation, a four-choice recognition test was given to that week's subjects. 
Included in the four choices were the right response, the wrong one which 
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had appeared on the card during the trials, and two distractors selected 
from a total pool of response words. 

Sub iects . Subjects were children in the fourth, fifth, and sixth 
grades. Each group replication involved only children from a single 
grade. 

Not all of the subjects included in the study reported in Chapter 3 
were included in the present study. The reason for this is that the pro- 
cedures for recording attention data had to be modified as experience in 
the early part of the data-col lection procedure demonstrated inadequacies 
in them. The data available thus shows differences in the number of cases 
involved from one learning conditions to another and from one attention- 
measuring procedure to another. 



Results 

The extent to which attention measures agree is provided in Table 
4.01. When the four learning conditions were combined, the correlation 
was .50 (N=153). When one considers the fact that the two measures of 
attention were produced by independent observers, the reliability of the 
measures is surprisingly good. 

Evidence is provided in Figure 4.01 to show that visual orientation 
to the task relevant stimuli declined with the passing of time. It is 
obvious that the decrease was greater within days than it was between. 

Of particular interest is the fact that the gradient representing declin- 
ing visual "attention" for Monday was not as steep, nor does it dip so 
low as on Tuesday and Wednesday. 
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Table 4.01 



B I SERIAL CORRELATIONS BETWEEN AN OBJECTIVE MEASURE OF 
ATTENTION AND AN IMPRESSIONISTIC MEASURE OF ATTENTION 



Learning Condition 



N r p 



I 38 .74 <01 

1 

“No, that's wrong" 

II "That's right" 39 *39 <01 

"That's wrong" 

Ml "That's right" 30 .45 <01 

"No, that's wrong, means ." 

1 ^ 46 .38 <01 

"No, that's wrong, ^means ." 
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The visual attention decrement occurr i nq within a single day cannot 
be directly shown with the present data to affect retention of the task. 
This is due to the fact that the twenty items, constituting one day's 
learning task, were repeated four times through the course of the trials. 

A decrement in attention could not be reflected in a reduction of measured 
learning since a presentation near the end, say at the 70th or at the 80th 
trial was a word already encountered three times previously in the ac- 
quisition series for that day. 

Relationship of "Attention" to Learning . Individuals learning scores 
separated according to feedback condition, were correlated with the ob- 
jectively measured attention values. For this purpose the two objective 
measures of attention were combined. The correlations are given in Table 
4.02. Of interest to note is the fact that the objective measure of at- 
tention was correlated positively with achievement only in the case of 
the first two learning conditions which required that the subjects obtain 
information via the visual channel if learning was to occur with maximum 
effectiveness. The negative correlation in the case of Condition 3 is 
not readily understood. 

In Table 4.03 the same relationships shown in Table 4.02 are given 
but with the subjects classified separately into those who responded part 
of the time to the experimenter's questions and those subjects who did 
not. These correlations lack sufficient stability for any pattern to be 
evident.^ 

A second, different type of attention measure had been obtained which 
depended upon the experimenter's general impression of who was attending 
and who was not. Correlations were computed between the amount of learning 
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Table 4,02 

CORRELATIONS BETWEEN OBJECTIVELY MEASURED ATTENTION AND 



LEARNING UNDER EACH 


OF FOUR 


KNOWLEDGE- 


OF-RESULTS 


CONDITIONS 




Learning Condition 






N 


r 


t 


P 


1 






38 


.28 


1.75 


.05 


"No, that ' s wrong" 










1 1 "That ' s right" 






39 


.21 


1 .31 


JO 


"That's wrong" 














1 1 1 "That ' s right" 






37 


-.22 


1 .38 


.10 


"No, that's wrong, 


jneans 


• 










IV 






46 


-.03 


0.20 




"No, that's wrong, 


jneans 


• 
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Table 4.03 



CORRELATIONS BETWEEN OBJECTIVELY MEASURED ATTENTION AND 
LEARNING UNDER EACH KNOWLEDGE-OF-RESULTS CONDITIONS, 
SUBJECTS DIVIDED ACCORDING TO WHETHER SUBJECTS 
WERE OR WERE NOT RESPONDING PART OF THE TIME 









Subjects Resp- 
onding 1/4 the 
time 


Subjects Not 
Respofiding at 
any time 


1 






.67’*-’' (N«20) 


.28 (N=I8) 




"No, that 's wrong" 








1 1 


"That's right" 




.09'TN=19) 


.37* (N=20) 




"That's wrong" 








1 1 1 


"That's right" 




-.12 (N=19) 


-.29 (N=18) 




"No, that 's wrong, 


means ." 






IV 






.20 (N*24) 


-.08 (N=22) 




"No, that's wrong. 


means ." 







* Significant at .05 level 
** Significant at .01 level 



and the experimenter's impressionistic judgement of which Ss are shown 
in Table 4,04, 

These ratings were produced by E at the conclusion of each learning 
session and without communication with the observing experimenter who 
made the objective recordings of visual attention. Again, as with the 
objective measures, the auditory feedback conditions which provided 
larger amounts of information show no correlation of attention with amount 
learned. Where so little auditory information was given that visual at- 
tention was essential to learning, there did exist substantial correla- 
tions (conditions 1 and 2). 



Pi scuss i on 

While the study presented in this chapter is peripheral to the main 
sequence of studies, it is reported because it deals with a problem of 
central importance to classroom management. Teachers want pupils to 
"attend", particularly during recitation sessions and the common criterion 
of attention is whether the gaze of the pupil is directed towards the 
teacher or towards the discussant or towards whatever is the central 
source of information. The data reported indicate that this criterion 
of attention may have some limited utility when the source of information 
requires the intake of visual information. However, when the source of 
information is auditory, or the transmission of visual information is re- 
dundant or unnecessary, then this criterion of attention has no value. 

The emphasis which has been placed on the position of the eyes in determ- 
ining whether a child is or is not attending appears to be unrealistic. 

Rather than being preoccupied with the direction of the eyes, the 
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Table 4.04 

B I SERIALS CORRELATIONS BETWEEN IMPRESSIONISTIC JUDGMENTS 
OF ATTENTION AND LEARNING UNDER EACH OF FOUR 
KNOWLEDGE-OF-RESULTS CONDITIONS 



Learning Condition 


N 


r 


p 


1 

"No, that's wrong." 


52 


.59 


<.01 


1 1 "That ' s right" 

"No, that's wrong." 


60 


.40 


.^.01 


1 1 1 "That ' s right" 

"No, that's wrong, means " 


60 


.03 




IV 

"No, that's wrong, means " 


68 


.00 
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teacher should be concerned with three aspects of processes related to 
attention. First, there is the matter of whether the pupil can, under 
the given circumstance receive the message that is being transmitted to 
him. Except for the case of the transmission of visual information, the 
teacher can do little to determine whether the information is or is not 
being received, but he can make sure that there are no obvious inter- 
fering circumstances such as competing sources of information, noise, 
and so forth. Second, the teacher should be concerned with whether the 
level of arousal of the learner is sufficient to produce effective learn- 
ing. The only objective criterion is whether the pupil is asleep or 
awake, but conditions can be arranged in the classroom so that there is 
sufficient activity to ensure an adequate level of arousal. Third, the 
teacher must be concerned with the internal condition of the pupil and 
whether it is such that he is utilizing and storing the information pro- 
vided. This can only be determined by checking on the pupil and determ- 
ining how well his information-using processes have been working. 



CHAPTER V 



EFFECTIVENESS OF PUPILS AS REINFORCING AGENTS 
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The Problem 

Teachers in most elementary school classrooms have paired pupils 
for various activities in their daily instructional experiences. This 
study was an attempt to investigate this classroom procedure as a method 

of instruction. 

This was a study of verbal learning and reinforcement in a simu- 
lated classroom situation with four different learning conditions 
involving the interaction of pupils one with another and the interaction 
of individual pupils with the task alone. 

The four conditions of learning established for this study were 
described as follows: Conditions I, M and III all involved pairs of 

pupils working together on the task. Condition IV involved the pupil 
working with the task alone. 

Condition I and II occurred in the same set of pupil pairs. One 
member of the pair (Condition I) assumed the teacher's role and retained 
that role throughout the learning task. The other member of the pair 
(Condition II) assumed the pupil role also throughout the learning task. 

Condition III also involved a treatment where pairs of pupils 
worked together. One pupil assumed the pupil role and one pupil the 
teacher rolej these roles were reversed at the mid-point of each task. 

Condition IV was an isolation type of treatment in which the 
subjects did not interact verbally in pairs as in all other conditions. 

In this condition the subject worked with the task as a self-instruction 
device which the subject used and controlled by himself. 

The interaction of pairs using the task and the interaction of 
pupils with the task alone were completed without reinforcement procedure 
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by the experimenter. Reinforcement came from the pupils within the 
pairs and from the task. 

General Statement of Purpose 

The intent of the present study was to answer certain questions 
regarding the effectiveness of learning under conditions of indivi- 
dualized instruction in which pupils teach pupils and in which pupils 
work alone. 

The study was undertaken in a setting which simulated a conven- 

* 

tional classroom. The teacher-experimenter was not given a role of 
teaching, but served instead as an observer, recorder, and as a resource 
person for questions about procedure. Pupils assumed the role of teacher 
or pupil and worked directly with each other in a one-to-one relation- 
ship. The classroom became a complex of pupil-pupil interaction. 

In addition to the investigation of the methodology above, this 
study was a follow-up of studies involving teacher-pupil interaction, 
both direct and vicarious, described in previous chapters. The study 
used the same task. Comparative data will be presented in a later 
chapter. 

This study attempted to answer the following: 

1. Does reinforcement of correct responses to a given rote 
memory task by a fellow pupil produce learning comparable to that pro- 
duced by teacher reinforcement? In what measurable amounts does this 
learning occur? and how well is it retained over a brief period of 
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2. What learning takes place in pupils acting as the teacher- 
reinforcer? Does the fact that they give the reinforcement rather than 
receive it make a difference in learning this task? 

3 . If the roles of a pupil-pair, those of pupil and teacher, 
are reversed for one-half the task, will the resulting change in role 
produce better rote learning? 

4. What relationships produces the most effective learning for 

both members of the interacting pairs? 

5 . What learning results when the pupil reacts only to the rote 
learning task without interacting with another person? (As in the 
situation which takes place with a teaching machine.) 

6 . Is the social interaction reinforcement mode more effective 
than reinforcement provided directly by the task materials? 

7 . How much time is required to complete the learning task in 
each condition? Is one mode of learning more efficient than the others? 

8 . Does age (grade level) affect this kind of learning? If so, 
in what ways? 



Procedures 

The establishment of procedures used in selecting and identi- 
fying subjects, a description of the task, procedures used by the exper i 
menter in each learning condition, and methods of collecting data are 
presented in this chapter. 

Subjects . Subjects were fourth, fifth and sixth graders in the 
public schools of Salt Lake City. A total of 208 subjects was utilized 
with data from 192 included in the final report of the study. The 
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additional two groups of eight subjects were used to supplement groups 
where absenteeism did not allow one subject to complete all phases of 
the learning experience. 

Task difficulty did not allow experimentation at a much lower 
grade level. It was experimentally important that the subject be naive 
in the task he was to learn, but capable of learning the materials 
involved. 

A reading test score was used to equate data groups for each condi- 
tion in the learning experience. A total division Into twenty-four sets 
of eight subjects each was made before the study began (see Table 5.01). 
Groups were equated, as nearly as was possible, by drawing subjects 
with scores from above and below the median reading score of that group 
and placing them in equal numbers In each condition to be studied. 

Pupils with German language background or previous German instruction 
were not used In the study. 

The Task to be Learned 

The learning tasks consisted of sixty German stimulus words which 
were to be correctly associated with one of two English responses. The 
correct response appeared on one side of a card which had the stimulus 
word plus the two possible responses on the opposite side. The task 
is the same as that described In the introduction to this report but 
it involved the following reinforcing conditions: 

Condition I & 1 1 . These conditions of learning involved the pairs 
of subjects whose roles remained constant in the study. Condition I 
was the pupil acting as the teacher who presented a card showing the 
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TABLE 5.01 



READING SCORE PLACEMENT VALUES UPON WHICH 
PUPIL-SETS WERE ESTABLISHED 







Grade 


Four 


Grade 


Five 


Grade 


Six 






SETS 
















1 


2 


3 


4 


5 


6 






METROPOLITAN 


READING 


SECTION 


TEST 




Condition 1 


1 


4.9 


3.9 


5.7 


3.5 


7.9 


6.1 




2 


7.2 


6.1 


5.6 


10.0 


2.7 


6:1 




3 


5.3 


3.6 


5.1 


7.3 


3.2 


6 .8 




4. 


5.1 


6.3 


4.9 


3.5 


2.7 


7.2 




5. 


4.2 


2.5 


5.9 


7.3 


7.7 


3.9 




6 


4.4 


3.3 


6.3 


4.8 


6.8 


4.0 




7 


4.9 


7.2 


6 .6 


7.7 


3.6 


4.4 




8 


7.9 


7.2 


6.^6 


4.2 


3.9 


4.9 




Total 


43.9 


40.1 


46.7 


48.3 


38.5 


43 .4 


Condition II 


1 


3.9 


3.8 


5.3 


4.4 


7.9 


6.1 




2 


5.7 


6.8 


4.9 


9.2 


3.9 


6 .3 




3 


5.3 


3.3 


5.1 


10.0 


3.2 


3.9 




4 


7.9 


6.3 


4.9 


3.0 


2.3 


7.2 




5 


4.2 


4.0 


6 . 1 


7.7 


6.3 


4.4 




6 


4.2 


4.0 


6 . 1 


4.5 


6.3 


4.2 




7 


4.5 


6.8 


6 .6 


9.2 


3.8 


4.5 




8 


6.3 


7.9 


6 .6 


3.7 


3.4 


5.1 




Total 


43.0 


42.9 


45.6 


51.7 


_1ZJ 


41 .7 


Condition 1 1 1 


1 


3.9 


4.4 


6.8 


5.3 


5.7 


5.1 




2 


4.0 


5.1 


7.1 


5.1 


5.5 


5.7 




3 


6.3 


4.4 


4.9 


5.1 


5.7 


6.1 




4 


6.1 


4.7 


4.5 


4.7 


5.5 


7.9 




5 


3.9 


4.7 


7.3 


5.3 


4.9 


6.3 




6 


4.0 


5.3 


7.7 


4.9 


4.3 


7.2 




7 


6.1 


5.5 


4.7 


4.9 


4.7 


7.7 




8 


7.2 


6.3 


^■7 


_Jl5 


4.2 







Total 


41 .5 


40.4 


_JtZJ 


39.8 


40.5 


53.9 


Condition IV 


1 


3.7 


4.0 


8.0 


5 J 


3.8 


7.9 




2 


3.8 


4.0 


8.0 


5.5 


4.5 


7.9 




3 


4.9 


5.1 


8.4 


5.5 


4.9 


3.2 




4 


4.7 


4.0 


8.4 


6.1 


5.1 


3.8 




5 


7.2 


4.0 


3.8 


6.3 


6.3 


7.9 




6 


7.7 


4.2 


4.0 


6.6 


5.3 


3.5 




7 


7.2 


4.2 


4.2 


6 .6 


5.5 


7.9 




8 


3.4 


4.3 


4.2 


7.3 


-U 


3.8 




Total 


42.6 


33.8 


49.0 


49.6 


41 .1 


45.9 



m mmMf 



mm 



stimulus word to the subject acting as the pupil (Condition II). The 
pupil selected one of the response words and the pup i 1-teacher stated, 

"the right answer is This statement was made in all 

cases regardless of the correct or incorrect responses of the subject. 

Cond i t i on III . The subjects followed the same procedures here 
as in Condition I & II but reversed roles after the daily study was 
one-half completed. 

Cond i t i on IV . In this condition the subjects did not receive feed- 
back from a "teacher" but they did receive feedback from the cards 
having the stimulus word and responses on one side and the statement 

"the right answer is " on the reverse side. These subjects worked 

with the cards. They sub-vocal ly made a response to the stimulus word 

and then reversed the card to read the reinforcing statement. A repre- 

sentation of the four conditions is found in Figure 5.01. 

The task was presented on three-by-five cards. The cards used 
in Conditions I, II and III were identical on both sides except that 
on the teachers side the correct answer was marked with an asterisk 
(Figure 5^02) . 

The cards used in Condition IV were identical to those in Condition 
I and had the reinforcing statement on the opposite side (Figure 5.03). 

Data was gathered with one testing device, a multiple choice test 
previously described through which the material for each day was tested. 
The test was administered on Friday to the groups completing the learn- 
ing task that week. The same test was used to provide retention data 

on a re-test of subjects. This re-test was given to all subjects on 
the same day and gave a retention score with delays of nine, eight. 
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Condition 2 



Condition 3 



Condition 4 
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Teacher role throughout 



Pupil role throughout 



Teacher and pupl 1 rd>les 
reversed In the middle 
d>f the task 



i t . . . 



Self Instruction role 






Fig. 5.01 Four conditions of learning. 
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Fig. 5.02 Card used in conditions S and II. 



The rs^h-t answer Is 



Rei nforcement 
si de 




Fig. 5.03 Card used in Condition IV 

seven, five, four and three weeks for eight subjects in each of the 
four conditions. 

Only two responses were given so that immediate knowledge of 
correct or incorrect response was constant for all subjects. 

Design 

A simulated classroom situation was the setting for all experi~ 
mentation. A teacher-experimenter was In the room as an observer and 
final authority or resource person, but was not actively engaged In 
teaching situations found in the four learning conditions. The teacher- 
experimenter gave instructions on how the experiment would be conducted 
and provided for all physical equipment necessary. 

Subjects were paired by reading score data and four pairs met and 
worked together to learn the task. Subjects within the pairs were 
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assigned a teacher _or pupil role which was maintained throughout the 
experiment. Other pairs reversed the pupil and teacher roles after 
one-half of each day's material was completed. 

Eight other subjects met with the teacher-experimenter and worked 
individually with the task on flashcards. The reinforcement statement 
was printed on the back of each card. 

Twenty items of the task were studied on each of three days, 

Monday, Tuesday and Wednesday with testing being completed on Friday. 

Teacher-Experimenter Procedures 

The teacher-experimenter established the classroom procedures to 
be used in each of the four conditions. Seating arrangements were 
made to suit the conditions with subjects facing one another in Condi- 
tion I, II and III, and subjects seated separately in Condition iV. 

General instructions to the subjects were standard for all condi- 
tions. Special instructions were also given establishing the particular 
condit i on. 

Standard Instructions . The experimenter directed subjects to 
their proper seats and then stated, “You are here to learn some German 
words and what they mean in English. You will work with these cards 
which have the German word and two English words on them. The cards 
are in the order you are to use them. You will go through the cards 
twice. My helper (second experimenter) will work with me with these 
four cards and we will show you just how you are to use the cards to 
learn the meanings of these words. You will then go through four trial 
cards to be sure you understand the instructions. It is important that 
you watch me carefully as we do this, so that you can do it the same way. 



5.12 



I 



Cond 1 1 i on I and I I . The experimenter stated, “You four children 
on this side of the desks will be the teachers in this group. You four 
on the other side are pupils. The ones serving as the teacher will 
show the card to the pupils. Like this (Experimenter will demonstrate 
by presenting the first card to his helper). The teacher will say the 
German word. The pupil will also say the German word and the two 
English words. The pupil will then choose one of the English words as 
the answer and say it, (experimenters demonstrate) and then the teacher 

will say the right answer is , and go to the next card. You will 

go through this stack of cards twice. When you finish, raise your hand. 
Are there any questions?” (Questions were answered) “You may now go 
through the four trial cards on the table. We will watch you and help 
you if you need help.” 

Condition Ml . The experimenter gave the same instructions as in 
Conditions I and II but gave instructions that the teacher and the pupil 
would change roles at the half-way mark before going through the stack 
the second time. 

Condition IV . In addition to the standard instructions, the experi- 
menter demonstrated the method used in this condition. The experimenter 
picked up a sample card prepared for this condition and explained to 
the subject, “You are to look at the card and in your mind choose one 
of the answers. They you will turn the card over like this and you 
will read to yourself, silently, the right answer i s You can 

see that you have the right answer on every card. Check your choice 
against this answer. Now you may go through these four trial cards." 
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A try-out of the design using two subjects in each of the condi- 
tions was completed to test the instructions and the design before the 
data for the study were collectedo No significant changes in procedure 
were made after the try-out. 



RESULTS 

The data for this study was collected fromfbur sources. The 
first two sources were scores from immediate and delayed retention 
tests of the task. The third source was a time record on each subject 
indicating the time taken to complete each day’s task and the fourth 
source was the observed behavior of the subjects recorded by the experi 
menter in anecdotal and tabulated form. Results from these sources 
are reported in the following sections of this chapter; (1) the imme- 
diate Retention Test Data; (2) the Delayed Retention Test Data; (3) the 
Time Record Data. 



immediate Retention Test Data 

Data was gathered on each of the 192 subjects involved in this 
study with a test of the learning task. The test was given on Friday 
to subjects who had worked with the learning tasks that week. 

The means and standard deviations were computed for each of the 
sets of subjects at each grade level in each condition and for each of 
the three tasks. Tables 5»02 and 5«03 were constructed to show these 
data. The means for Task A were higher than those of Task B and C. 

The means for Condition I, teacher role only, were generally lower than 
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TABLE 5.02 

MEANS AND STANDARD DEVIATIONS 
OF IMMEDIATE RETENTION TEST SCORES FOR GRADE 
LEVELS, CONDITIONS, AND TASKS 



Learni ng 
Condition 


Grade 


Task 

M 


A 

SD 


Task 

M 


B 

SD 


Task 

M 


C 

SD 


1 


4 


8.88 


3.02 


8.31 


3.08 


7.88 


2 . si 




5 


11 .69 


4.07 


8.13 


3.82 


7.00 


2.96 




6 


10.19 


3.17 


7.19 


1 .98 


6.81 


3.03 


1 1 


4 


12.31 


3.16 


8.63 


2.67 


9.88 


2.87 




5 


12.94 


2.82 


9.94 


2.49 


8.94 


2.75 




6 


12.25 


2.88 


9.13 


2.62 


9.31 


2.67 


1 1 1 


4 


13.19 


3.40 


9.31 


3.02 


8.19 


3.15 




5 


11 .94 


3.56 


8.56 


2.69 


7.88 


3.35 




6 


9.88 


3.33 


8.13 


2.50 


7.94 


2.77 


IV 


4 


12.06 


2.30 


8.50 


2.57 


7.44 


1 .97 




5 


12.75 


3.91 


8.19 


2.27 


7.81 


3.26 




6 


14.75 


2.77 


9.63 


2.29 


6.94 


2.84 



o 
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TABLE 5.03 

MEANS AND STANDARD DEVIATIONS FOR CONDITIONS BY TASKS 



Lea rni no 


A 




B 




C 




■ i • ■ i ^ 

rnnd i t i nns 


M 


SD 


M 


SD 


M 


SD 




1 (Teacher. Role Only) 


10.25 


3.64 


7.88 


3.09 


7.23 


3.01 


1 1 (Pupi 1 Role Only) 


12.50 


2.97 


9.23 


2.65 


9.38 


2.80 


III (Teacher and Plipil) 


11 .67 


3.69 


8.67 


2.78 


8.00 


3.10 


IV (Self-Instruction) 


13.19 


3.28 


8.77 


2.46 


7.40 


3.12 
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for all other conditions. Means for each grade level did not appear to 
vary very much. 

Means in Table 5*03 show that there are differences between condi- 
tions and even greater differences across Tasks A, B and C in each 
condition. 

In order to examine these differences the analysis of variance 
shown in Table 5.04 was computed with three main effects; conditions, 
tasks (days), and grade levels. 

Main Effects 

The significant main effects of this study were Conditions (p<.01) 
and Tasks (p<.001). Grade level differences were not significant. 

In order to examine the components of the variance of the third 
main effect Conditions, a comparison was made of each condition with 
each one of the other three. 

These conditions were previously defined but for convenience to 
the reader, the following brief description was re-stated. 

Condition I represented that treatment wherein one subject in a 
pair assumed the role of teacher and maintained that role throughout 
the experiment. 

Condition II represented that treatment wherein one subject in a 
pair assumed the role of pupil and maintained that role throughout 
the experiment. 

Condition Ml represented that treatment wherein both subjects in 
a pair were acting as teacher or pupil for one-half of the task 
and reversed their roles at the half-way point. 
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TABLE 5.04 

ANALYSIS OF VARIANCE FOR CONDITIONS, GRADE LEVELS, AND LEARNING 
TASKS — IMMEDIATE RETENTION TEST DATA 



Source of Variation 


Sum of Squares 


df 


M Square 


F 


P 


Between Ss 


3393 


191 


17.76 






Grades 


9 


2 


4.50 


0,27 




Condi t ions 


279 


3 


93.00 


5.58 


.01 


Grades X Conditions 


. 106 


6 


17.67 


1 .06 




Residual Between 


2999 


180 


16067 






Within Ss 


4138 


384 


10.78 






Tasks (Days) 


1682 


2 


841 .00 


141 .58 


.00 


Tasks X Grades 


23 


4 


5.75 


0.97 




Tasks X Conditions 


134 


6 


22.33 


3.76 


.00 


Tasks X Grades X Conditions 


162 


12 


13.50 


2.27 


.01 


Residual Within 


2137 


360 


5.94 






Total 


7531 


575 
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Condition IV represented that treatment wherein the subjects 
used a sel f- teach I ng device to learn the task and were not paired 
but worked alone. 

This main effect produced by these conditions was also examined 
statistically with an Individual degrees of freedom test which compared 
the Immediate retention test scores for earn condition. Table 5*05 
gives Information on these comparisons. 

This additional analysis provided evidence that all other condi- 
tions were superior to Condition !. The comparison of Condition II 
and IV, the two variables with the highest mean scores with Condition III 
and I, the two variables with the lowest mean scores, showed a superiority 
In favor of Conditions Jl and IV at the .001 level of confidence. 

When treated separately, learning In Condition III, teacher and 
pupil role, was superior to that In Condition I, teacher role only, and 
therefore Conditions II and IV were also significantly superior to 
Condition I, In that, the mean scores of these two conditions were even 
higher than the mean scores of Condition III. 

In comparing Conditions II and IV, the two conditions with highest 
scores. It was found that Condition II, pupil role only, was signifi- 
cantly superior to Condition IV at the .05 level of confidence. 

Although mean score comparison favored Condition IV, self- Instruct Ion , 
over Condition III, the difference did not reach a level of significance. 
Condition II was significantly superior to Condition 1 1 1 at .001 level 
of confidence. These two comparisons were not orthogonal, but the 
superiority was apparent from the total scores which showed Condition 




II to be superior. . 






TABLE 5.05 

INDIVIDUAL DEGREES OF FREEDOM TEST COMPARING 
IMMEDIATE RETENTION TEST SCORES OF 
THE FOUR CONDITIONS 



Source of 


Variation 


df 


Ms 


F 


P 


Condition 


1 1 & IV vs. Condo 1 1 1 & 1 


1 


550,13 


33.02 


O 

O 

O 


Condi t ion 


II vs . Condi tion III 


1 


184.26 


11 .06 


.001 


Condi t ion 


1 1 vs Condi tion IV 


1 


73 .50 


4.47 


.05 


Condi t ion 


III vs . Condi tion 1 


1 


213 .01 


12.79 


.001 


Condi t i on 


IV vs 0 Condition 1 1 1 


1 


25.01 


1 ,50 ' 





Note: All computations based on 180 degrees of freedom because 180 were 

involved in the error term from Table 5 .04 
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Interactions 

Two interactions in the analysis of variance reached levels of 
significance as was shown in Table 5»02. One double interaction, 

Task X Condition, was significant* This double interaction was plotted 
graphically in Graph 5.01. This graph was derived from the mean scores 
making up this double interaction and were shown in Table 5.03. Graph 
5.01 showed that Conditions I and IH had a similar learning decrement 
across the three tasks, while Condition M and IV varied somewhat across 
the three tasks. Condition II showed a sharp decrement in learning 
from Task A to Task B, but showed a slight increment in learning in 
Task C. Condition IV, on the other hand, showed a very sharp decrement 
from Task A to Task B and this decrement continued to Task C at a much 
steeper decline than in any of the other conditions. The differences 
in decrement between conditions, across the three tasks appeared to be 
one reason for this interaction being significant. In order to present 
evidence of this decrement the difference scores between tasks for each 
condition were computed and were shown in Table 5.06. 

The difference scores in Table 5«06 indicated the total decrement 
in correct responses from one day's task to the next. The difference 
scores demonstrated that the decrement from Task A to B was much greater 
than from Task B to C. 

These difference scores were compared using an a posteriori test, 

i 

Scheffe's Test for Multiple Comparisons . (Edwards, 1962, p. 15^), to 
determine if the differences varied significantly from one condition 
to any other. Table 5.07 was constructed to show the results of these 
tests and the resultant significant comparisons. 
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Fig. 5.04 Mean scores demonstrating double interaction of task 
X condition on immediate retention test data in Table 5«04. 
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TABLE 5 .06 

THE DIFFERENCE BETWEEN TOTAL SCORES OF TASKS A AND B, 
AND B AND C FOR EACH OF FOUR CONDITIONS 
OF IMMEDIATE RETENTION TEST 





Difference 


Between 




A - B 


B - C 


Condition 1 
(Teacher Role Only) 


114 


31 


Condition 1 1 
^Pupi 1 Role Only) 


157 


-7 


Condi t ion III 
(Teacher and Pupil Role) 


144 


32 


Condition IV 
(Self-Instruction) 


212 


66 



o 
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TABLE 5.07 

I 

SCHEFFE'S TEST FOR MULTIPLE COMPARISONS USING 
DIFFERENCE SCORES FROM TABLE 5 .06 TO 
COMPARE VARIATION BETWEEN CONDITIONS 



Source of Variation 


df 


Ms 


F 


P 


Condition IV vs. Ill from 










A to B 

Condition IV vs. II from 


1 


48.16 


8.11 




A to B 

Condition IV vs. 1 from 


1 


31 .52 


5.31 




A to B 

Condition IV va II from 


1 


100.04 


l6o84VwV 


.01 


B to C 


1 


55.50 


9 . 34 * 


.05 


Note: A1 1 terms based on 36 O degrees of 

i n Table 5 .04 „ 

*F' needed at .05 level = 9.12 
'«‘F' needed at .01 level = 14.13 


freedom as 


i n the error term 
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The condition which varied from the others at a level of significance 
was Condition IV where subjects used a self-instruction mode. The decre- 
ment in learning scores from Task A to Task B was significantly greater 
for Condition IV than for Condition I, teacher role only, (p>.01) and 
the decrement from Task B to C showed Condition IV with the greatest 
decrement of all conditions and significantly greater than Condition II, 
pupil role only, (p>.05). This final difference was also effected by 
the increment from Task B to C in Condition II. The difference in decre- 
ment between Condition IV and Condition II from Task A to B was not 
significant, and inasmuch as all other differences were less than this 
difference between Condition IV and II, all other differences were also 
not significant. 

Triple I nteract i on 

The triple interaction in Table 5.04, Tasks x Grades x Conditions 
was significant (p<. 01). The source of this interaction was considered 
to be generally very obscure, and because most authors admit that it 
is next to impossible to fit this kind of interaction into any conceptual 
scheme, no detailed analysis of the triple interactions in this study 
were completed. Although some graphic or geometric presentation is 
suggested by Winer (1962), the complexity of a geometric three dimen- 
sional graph was not practical. 

Delayed Retent i on Test Data 

Three weeks after the final group of thirty-two subjects had com- 
pleted the experiment, a delayed retention test of the tasks was given 
to all 192 subjects. Because of the organization of the public school 
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where the data were collected, it was impractical to have subjects from 
each grade level participate in the learning tasks each week; therefore, 
all subjects learning the task in a given week were from the same grade 
level. Thirty-two subjects, who completed the task each week, were 
divided equally among the four conditions of the experiment. The experi- 
ment was conducted for three consecutive weeks and then there was a 
break of one week to allow for the state education association meeting 
which curtailed school and then the experiment continued for three more 
consecutive weeks. When the delayed retention test was given to all 
subjects on the same day, this provided data which showed periods of 
delay in measuring retention of three, four, five, seven, eight and nine 
weeks. The delayed retention scores were computed for subjects in each 
condition to allow for comparison with the Immediate Test Data. 

An analysis of variance of the data from the delayed retention 
test was computed and is shown in Table 5.08. This analysis omitted 
grade level. 

The variance across Tasks A, B and C was still significant at 
the .001 level of confidence as it had been in the Immediate Test Data. 
Mean scores of the total scores for each day were plotted and the decre- 
ment from Task A to B and from B to C was apparent. The means were 9.56. 
7 . 70 , and 6.29 for Tasks A, B and C, respectively. 

Evidence presented in this analysis indicated that there was superior 
retention of the first day's task over the other two days and that Task 
B or the second day's task was superior to the final task. All of these 
differences were significant at the .001 level of confidence. 
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TABLE 5.08 

SOURCE TABLE FOR ANALYSIS OF VARIANCE OF 
DELAYED RETENTION TEST SCORES FOR THE 
THREE TASKS AND FOUR CONDITIONS 



Source of Variation 


Ss 


df 


Ms 


F 


P 


Between Ss 


2511 


191 








Conditions 


96 


3 


32.00 


2.49 


.10* 


Residual Between Ss 


2415 


188 


12.85 






Within Ss 


2707 


384 








Tasks (Days) 


1033 


2 


516.50 


119.00 


. .001 


Tasks X Conditions 


43 


6 


7.17 


1 .65 




Residual Within Ss 


1631 


376 


4.34 






Tota 1 


5218 


575 








’’?Note; An £ of 2.60 was 


needed at 


the .05 


level . 
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Conditions did not vary as they had on the Immediate test data. 
Analysis of variance indicated significance between the .10 level and 
the .05 level for differences among learning conditions. 

In order to examine the retention curve of the subjects across 
the six delay periods, the mean for each week was computed. The means 
were 24.87, 21.60, 24.30, 24.30, 22,56 and 23^73 for weeks three, four, 
five, seven, eight and nine, respectively. These means have_been 
plotted in Graph 5.02 which shows that the retention curve was basically 
flat. The difference between the highest and lowest means in this 
graph was only 3-27 with all other differences being less tha"* 3 * 27 . 

A table of means was constructed showing the mean scores of each 
group of eight subjects in each condition on both the ' Immed i ate Retention 
Test and the Delayed Retention Test, These means were computed for 
subjects in each condition who were involved in each period of delay, 
thereby giving a comparison of the means for each set of eight subjects 
on the immediate and delayed tests. These means shown in Table 5.09 
were used to develop a set of adjusted means from the means of the 
delayed retention scores. This was accomplished by finding the differ- 
ence between each immediate test mean and the average of the immediate 
test means and using that difference to adjust the delayed means. What 
this does is to correct each mean on the delayed test for the degree 
to which the group showed high scores or low scores on the immediate 
retention test. These adjusted means were then averaged by pairs across 
the six periods of delay to compute the average adjusted means. In 
effect, this was a way of reducing the variability between sets and 
visualizing a truer picture of what the actual retention curve for each 
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Fig. 5.05 Mean scores for delayed retention test for total 
scores for subjects in each of the six periods of delay. 
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TABLE 5.09 

MEAN SCORES FOR THE IMMEDIATE AND DELAYED RETENTION TESTS, THE ADJiiSTED 
DELAYED RETENTION MEANS, AND AVERAGE OF PAIRS OF ADJUSTED 
MEANS FOR EACH CONDITION ACROSS PERIODS OF DELAY 









Weeks 










3 


4 


5 


7 


8 


9 


Condition 1 (Teacher Role) 












Immediate 


25.62 


22.74 


24.70 


29.01 


25.38 


24.60 


Delayed 


22.38 


18.39 


21 .75 


27.00 


19.74 


19o50 


Adjusted Delayed 


22.08 


20.97 


22.47 


23.31 


19.68 


20.22 


Average of each pair 
of adjusted means 


21 .52 


21 .72 


22.89 


2 ! .49 


19.95 




Condition II (Pupil Role) 












Immediate 


32.58 


28.74 


30.63 


33o0C 


31 .62 


30,00 


Delayed 


26.64 


23.64 


21 .87 


26.01 


22.14 


23.37 


Adjusted Delayed 


25.17 


26.01 


22.35 


21 /I2 


21 .63 


23 48 


Average of each pair 
of adjusted means 


25.59 


24.18 


21 .74 


21 .38 


23,05 




Condition III (Teacher & 


Pupi 1) 












Immediate 


26.49 


25.38 


27.63 


29.13 


30o24 


31,14 


De layed 


23.01 


20.49 


24.12 


25.26 


24.87 


28 14 


Adjusted Delayed 


24.87 


23.46 


24.84 


24.48 


22,98 


25,35 


Average of each pair 
of adjusted means 


21 .92 


24.15 


24.60 


23.73 


24.67 




Condition IV (Self-D nstruct ion) 












Immediate 


33.39 


29.25 


31 .50 


26.01 


29.01 


27,00 


Delayed 


26.88 


23.88 


29.49 


19.26 


23 .49 


23,88 


Adjusted Delayed 


22.86 


24.00 


27.36 


22.62 


28,85 


26,25 


Average of each pair 
of adjusted means 


23.43 


25.68 


24.99 


25.74 


27.55 
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condition would be. These delayed retention curves for each condition 
were then plotted and are shown in Graph 5.03 as well as the average 
adjusted means to show the flattened effect on the retention curve. 

Although all four conditions showed a slight low in retention for 
subjects who completed the test after a delay of four weeks, the basic 
adjusted curve was fairly flat with differences between weeks of delay 
consistently small. 

A comparison of the loss of scores from Immediate Retention Test 
to the Delayed Retention Test was also made by computing the total scores 
for each condition on each test and finding the percentage of loss for 
the total group and for the subjects in each condition. 

Table 5.10 was constructed to show this data and this table illus- 
trates that the greatest percentage of loss took place in Condition II, 
pupil role only, with a loss of 23.0 per cent from one retention test 
to the other. The least per cent of loss occurred in Condition III 
with 14.2 per cent; this condition was that one wherein the teacher- 
subject and pupil-subject reversed their roles one-half way through 
each day's learning task. The total loss in correct response scores 
from one test to the other for all 192 subjects was 958 which repre- 
sented a percentage loss of 17.5 per cent. 

Time Record Data 

One of the major sources of data for this study was a time record 
which was made on each subject or pair of subjects as these subjects 
completed each of the three tasks. The time record showed in minutes 
the time taken to complete each of the Tasks A, B and C. Evidence of 
differences by condition, task, and grade level were sought. The means 
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score 

30 

28 



Delayed Retention 
Adjusted Retention 



26 

24 



Condition I 
(Teacher Role) 




345789 weeks 




Fig. 5.06 A comparison of the delayed retention means for each 
of the six periods of delay and the average of pairs of adjusted means 
(from Table 5*09) . 
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FIGURE 5 -OS (Continued) 
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TABLE 5.10 

DIFFERENCE BETWEEN TOTAL SCORES FOR IMMEDIATE AND 
DELAYED RETENTION TEST AND THE PERCENT OF LOSS 



FOR 


CONDITIONS 1,11, 


III AND !V (N 


= 48 FOR EACH 


CONDITION) 




Immediate 


Delayed 


Difference 


Percent of loss 


Condition 1 


1217 


1030 


187 


15.4 


Condi t ion II 


1493 


1149 


344 


23.0 


Condition 1 1 1 


1360 


1167 


193 


14.2 


Condition IV 


1409 


1175 


234 


16.6 


Tota 1 


5479 


4521 


958 


17.5 



I 
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and standard deviations for these data were computed and are shown in 
Tables 5.11 and 5.12. 

Means and standard deviations were computed for each of the sets 
of subjects at each grade level, in each condition and for each of the 
three tasks. Table 5.11 was constructed to show the means across tasks 
and conditions by each grade level. The data indicated that each grade 
level shows a decrement of time required to complete the tasks with less 
time required to complete Tasks B or C than to complete Task A. It 
was also apparent from these mean times that fourth graders needed more 
time to complete the task than fifth graders; and fifth graders required 
more time than sixth graders to finish the tasks. 

In Table 5.12 the mean times for each of the four conditions across 
the three tasks were computed. Conditions I and II represented subjects 
working together so their times were basically identical; the slight 
variations were due to necessary substitution of subjects because of 
absenteeism. 

Condition III, which also represented pupil-pairs working together 
on the Tasks, had very similar mean times to those of Conditions I and 
II. Conditions IV mean times were sharply lower than those of all other 
conditions; in fact, less than half as much time was required by subjects 
in Condition IV to complete the tasks than in any of the other conditions. 

An analysis of variance of the time record data was completed and 
the source table for this analysis is shown in Table 5.13. 
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TABLE 5.11 



MEANS AND STANDARD DEVIATIONS OF TIME 
RECORD DATA FOR GRADE LEVELS, 
CONDITIONS, AND TASKS 



Learni nq 


Grade 




A 


B 


C 




Condition 


M 


SD 


M 


SD 


M 


SD 


1 


4 


16.34 


3.73 


15.45 


3.47 


13.59 


3.25 




5 


15.23 


2.93 


13.54 


1 .82 


13.46 


2.01 




6 


12.85 


1 .05 


11 .02 


1 .41 


10.98 


1 .93 


1 1 


4 


16 .36 


3.72 


15.39 


3.50 


13.41 


3.31 




5 


15.24 


2.93 


13.54 


1 .82 


13.46 


2.01 




6 


12.85 


1 .05 


11 .02 


1 .41 


10.98 


1 .93 


III 


4 


17.55 


3.06 


15.31 


2.86 


13.48 


1 085 




5 


14.69 


1 .63 


14.34 


2.39 


11.16 


2.60 




6 


13.26 


1 .45 


12.46 


1 .57 


11 .23 


1 .32 


IV 


4 


8.04 


2.28 


6.82 


1 .60 


6.15 


1 .20 




5 


6.97 


1 .67 


5.68 


1 .43 


7.60 


0.98 




6 


8.04 


1 .30 


5.46 


1 .02 


4.87 


1.15 
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TABLE 5.12 

MEANS AND STANDARD DEVIATIONS FOR TIME 
RECORD DATA WITHOUT REGARD 
TO GRADE LEVELS 



Task A Task B Task C 

Conditions M SD M SD M SD 



I (Teacher Role Only) 14.81 3*15 13=34 2,99 12.68 2.72 

II (Pupil Role Only) 14.82 3=14 13=31 3=03 12.62 '2.74 

III (Teacher and Pupil) 15=17 2.79 14. 04 2.59 11.96 2.26 

IV (Self-Instruction) 7=68 1.87 5-99 1.48 6.21 1.56 
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TABLE 5.13 

ANALYSIS OF VARIANCE FOR CONDITIONS, GRADE 
LEVELS, AND LEARNING TASKS - SOURCE 
TABLE - TIME RECORD DATA 



Source of Variation 


Ss 


df 


Ms 


F 


P 


Between Ss 


8311 .36 


191 








Grades 


731 .05 


2 


365.53 


30.93 


.001 


Conditions 


5311 .05 


3 


1770.35 


149.78 


.001 


Grades by Conditions 


141 .75 


6 


23.63 


2.00 




Residual Between Ss 


2127.51 


180 


11 .82 






Within Ss 


1469.84 


384 








Tasks (Days) 


501 .38 


2 


250 .69 


119.95 


.001 


Tasks by Grades 


40.86 


4 


10.22 


4.89 


.001 


Tasks by Conditions 


70.39 


6 


1 1 .73 


5.61 


.001 


Tasks by Grade by Conditions 


101 .09 


12 


8.43 


4.03 


.001 


Residual Within Ss 


753.52 


360 


2.09 






Tota 1 


9778.60 


575 












Main Effects 



All the main effects of the time data v\®re significant -- grade 
level (p<.001), tasks (p<.001) , and conditions (p<.001) (Table 5.13). 

The time means for grade levels were 39.^7, 36.23, and 31.25 for 
the fourth, fifth and sixth grades, respectively. The latter variance 
was further tested statistically by an individual degrees of freedom 
test to determine which grade levels were varying significantly. The 
results of this degrees of freedom test are shown in Table 5.1^. 

The results shown in Table 5.1^ demonstrated that the sixth grade 
subjects took significantly less time to complete the tasks in this 
study than did the fourth and fifth grade subjects. The fifth grade 
subjects also took significantly less time to complete the tasks than 
did the fourth grade subjects. The difference in time taken to complete 
the tasks varied significantly from grade level to grade level with 
the fourth grade subjects taking the longest time, the fifth grade sub- 
jects requiring significantly less time than the fourth grade subjects 
and the sixth grade subjects taking significant ly less time than the fifth 
grade subjects to complete the tasks. 

The second significant main effect in the analysis of variance 
was the difference between time taken to complete each of the three 
tasks. The mean times for subjects to complete Tasks A, B and C were 
13 . 12 , 11.67 and 10 . 86 , respectively. Task A completed on Monday required 
the longest time to complete. The Tasks B and C which were completed 
on Tuesday and Wednesday each required subjects successively less time 
to complete. 
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TABLE 5.14 

INDIVIDUAL DEGREES OF FREEDOM TEST 
COMPARING TIME DATA OF THE 





THREE GRADE 


LEVELS 






Source of Variation 


df 


Ms 


F 


P 


Grade 4 vs. the average 


of Grades 5 and 6 


1 


350.56 


29.66 


.001 


Grade 5 vs. 6 


1 


790.53 


66.88 


.001 


Note: All terms based on 

involved in the error term 


180 degrees of 
in Table 5.13 


freedom since 

« 


180 ^ were 





o 
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The final main effect, Conditions, also showed a significant 
variance in the analysis of variance (Table 5.13). However, in this 
main effect three conditions had mean times which varied only slightly 
while the fourth condition had an unusual variation from the other 
three. Conditions I and 1 1 , of course, involved the same pairs and 
the time score means in these two conditions would have been identical 
except for a substitute subject being used to complete the data in 
Condition I because of an absentee. Condition III, which also was a 
condition involving pairs of subjects, varied only slightly from Condi- 
tions I and II. The mean times for the four conditions, I, II, III and 
IV were 40.82, 40.75, 4l.l6 and 19.88, respectively. 

The lack of variatjon between the time needed to complete the task 
for the conditions involving pupil-pairs was evident also in an indivi- 
dual degrees of freedom test comparing the time data of the four condi- 
tions. No significant differences were found between Conditions I, II 
and III, but Condition IV varied significantly from all other conditions. 

By testing the variance between Conditions I and III, it was demon- 
strated that no significant difference was required in the time to 
complete the tasks when pupil-pairs worked on the task together. Condi- 
tions I, II and III, theVaired conditions, did not vary significantly, 
although subjects in Condition III required slightly more time to complete 
the task. However, Condition IV, self-instruction, varied very signifi- 
cantly from the other conditions in that subjects in Condition IV 
required less than half the time to complete the tasks than those sub- 
jects completing the tasks in Conditions I, II and III. The level of 
significance for this difference was .001. Inasmuch as Condition II, 
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pupil role only, had the lowest mean time of Conditions I, 01 and 111 , 
it was only necessary to compare this condition with Condition SV, self- 
instruction, in order to establish the significance of variation of 
Condition IV to all other conditions. 

Interactions 

The interactions which had Tasks as one of the interaction com- 
ponents were all significant in the analysis of variance in Table 5.13» 
The double interactions Tasks x Grades and Tasks x Conditions were 
examined graphically to show how the components of these interactions 
might have varied from one another enough to become significant. A 
graphic presentation was used because the variation was discernible in 
this type of presentation. If the interaction were zero, the components 
in a geometric or graphic presentation would be parallel even though 
of different strength numerically. The variation created by the inter- 
action would force the graphic presentation away from this parallel 
graphic presentation (Winer, 1962). 

First, the interaction Tasks x Grades was plotted using mean scores 
from Table 5.11 as a basis for the graph. Graph 5.04 was constructed 
to illustrate this interaction. The time decrement for the fourth and 
fifth grade subjects from Task A to B followed a parallel or nearly 
parallel form and subjects from grades five and six followed a parallel 
time decrement from Task B to Task C. Two lines in the graph are not 
in parallel form: (1) Sixth grade subjects showed a sharper decrement 

in time required to complete the task from Task A to Bj (2) Fourth 
grade subjects did not show a leveling off of time from Task B to C as 
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did subjects in grades five and six. The decrement continued down at 
a sharper angle for fourth grade subjects. 

The second double interaction from the analysis of Time Record 
Data which was significant was Tasks x Conditions. This interaction 
was also plotted graphically using means from Table 5.12. Graph 5.05 
was constructed to show this interaction. It is difficult from this 
graph to make any concise statement about the nature of the interaction. 

Efficiency of Learni ng 

One of the basic reasons for recording time data was to attempt 

to develop an efficiency of learning score. This was done for each 

condition and each grade level by developing the formula ES = (ES = 

CR 

Efficiency Score, TT = Total Time, CR = Correct Response on Immediate 
Retention Test). The fewer minutes required to learn a correct response 
the more efficient the learning was considered to be. 

First, the formula was applied to each of the four conditions and 
the results were recorded in Table 5.15. 

Condition IV, self-instruction, was the most efficient learning 
condition with a score of 0.68. All other conditions required about 
double the time per correct response. 

An efficiency score for subjects in each grade level was also com“ 
puted and recorded in Table 5.16, 

Although the differences in efficiency scores by grade level were 
slight, a trend of efficiency related to age was apparent. Sixth grade 
subjects were most efficient in learning the task (ES = 1.11), followed 
by fifth graders (ES = 1.25) and the least efficient were fourth graders 
(ES = 1,38). 
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Condi t i ons 




Fig. 5.08 Mean times demonstrating the double interaction 
tasks X conditions from time record data in Table 5.12. 
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TABLE 5.15 

LEARNING EFFICIENCY SCORES FOR SUBJECTS IN 
CONDITIONS I, II, III AND IV 



Condition 


Total Time 


Correct Responses 


Learning Tii 
Per Correct 
Response 


1 (Teacher Role Only) 


1959.3 


1217 


1 .61 


1 1 (Pupi 1 Role Only) 


1956.0 


1493 


1 .31 


III (Teacher & Pupil Role) 


1975.7 


1360 


1 .45 


IV (Self-Instruction) 


954.0 


1409 


0.68 
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TABLE 5 .16 

LEARNING EFFICIENCY SCORES FOR SUBJECTS 
IN GRADES FOUR, FIVE AND SIX 



Grade 


Total Time 


Correct Responses 


Learning Time Per 
Correct Response 


4 


2526.3 


1833 


1 .38 


5 


2318,4 


1854 


1.25 


6 


2000.3 


1794 


1.11 
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EXPERIMENTER OBSERVATION OF THE SUBJECT BEHAVIOR 
DURING LEARNING OF THE TASK 

In addition to the data collected by formal instruments such as 
test scores and timing scores, the experimenter made a written record 
of the behavior of subjects which varied from the necessary procedural 
behavior required to learn the task. 

During the first two weeks of the study, the experimenter made 
extensive notes of the behavior beyond the expected procedural behavior. 
At the end of that time, the experimenter classified the notes taken 
and gave categorical titles to the behaviors that appeared two or more 
times in his notes. The behaviors were defined by the verbal or motor 
activities which were observed. 

In Conditions I, II and III, where pupils were paired, much more 
overt behavior was noted than in Condition IV where pupils worked on 
the task alone. The only notable factor recorded concerning Condition 
IV was the factor of competition between subjects on speed in completing 
the task first, and this occurred with only six of the forty-eight sub- 
jects in that condition. 

Table 5.17 was constructed to show these recurring behavioral inci- 
dents and the number of times these incidents were observed and noted. 
Because of the number of subjects involved, no breakdown was made of 
behavior by condition, but because all subjects in a given set were 
from the same grade level a breakdown by grade level was possible. 

The results of these tabulated forms of behavior were analyzed 
to give some indication of the types of behavior which were most pre- 
valent at a given grade level. 
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TABLE 5.17 



if 



BEHAVIOR OTHER THAN EXPECTED PROCEDURAL BEHAVIOR OBSERVED 

AND RECORDED BY THE EXPERIMENTER 















7o of 






Grade Level 


Total 


Grand 


1 tern 


No. Observed Behavior 


4th 


5th 


6th 


per i tern 


Total 


1 . 


Competition - based on time to 
complete the task (speeding-up 














of pace. Comments such as 
"Hurry up'. We're ahead." etc.) 


44 


35 


17 


96 


16.1 


2. 


Verbal interactions other than 
the task within or between pairs 
(1 miss that everytime." "Okay" 


• 












"Don't talk so loud." "1 thought 












that was it." "How many more?" 
"That was easy." etc). 


25 


27 


23 


75 


12.6 


3. 


Questions asked by subjects 
about procedure before beginning 








^ 


12.4 




the task. 


43 


19 


12 


74 


4. 


Distraction peer pressure. 
(Looking at peer's task to see 
how they compared in amount 
completed) 


15 


27 


22 


64 


10.7 


5. 


Anxiety over incorrect responses 
(Shifting position, frowning, 
striking table with hands, tapp- 


• 












i ng with feet or hands, groaning) 16 


19 


22 


57 


9.5 



6. Laughter, comment or anxiety 

about pronunciation. 11 12 

7. Pupil-subject correction of 
teacher mistake (Pronunciation, 
procedure, checking, e.g. "Did 

you say baum?") . 25 6 

8. Subject concern with experimenter 
observation. (Looking directly 
at the experimenter to see if 

they are being observed). 10 16 



20 



11 



43 



42 



7.2 



7.0 



12 



38 



6.4 
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TABLE 5.17 (Cont.) 



Item No. Observed Behavior 



% of 

Total Grand 

4th 5th 6th per item Total 



9. Subject-teacher making correc- 
tions of pupil work or giving 
directions to pupil. 10 11 I3 

10. Pup! 1 -subject giving response 
without oral verbal cue from 

the teacher . 2144 



11. Obvious and overt lack of atten- 
tion to the task-card at the 
time of the reinforcement state- 
ment by the teacher. (Looking 
away from the task completely 

at the time of feedback.) 15 

12. Pupil -subjects (Cont. II) who 

want to have a turn at being 
teacher and ask experimenter if 
they could be the teacher. 5 

13. Subjects asking to go through 
the task an additional time 
even though instructed expli- 
citly on the number of times 

to do the task. 4 

14. Subject asking for experimen- 
ter's help during learning 

task. 3 



5 1 



0 2 



1 2 



2 0 



15. Subject serving as teacher in 
a pair not taking charge and 
getting procedure started. 

(Requiring prompting from 

experimenter.) 3 2 0 



34 



29 



21 



7 



5.7 



4.9 



3.5 



1 .2 



1 .2 



.8 



.8 



t 
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Grand Totals 



250 186 161 



597 



100 % 



GeneraHy, the results indicated much more variant behavior at 
the fourth grade level than at the other grade levels. A total of 250 
incidents were recorded at the fourth grade level, and while this was 
a small number compared to the procedural interactions which took place 
between all subjects at a given grade level (15,360), the number for 
fourth graders was sixty-four more incidents than those observed in the 
same numbers of fifth grade subjects and eighty-nine more incidents 
than those observed in s^th grade subjects, The fourth grade sub- 
jects had 41.8 percent of the total incidents recorded for the three grades. 

The results also showed patterns of behavior related to grade 
level. Fourth graders generally were interested, not only in following 
procedure, but also in following the progress of their fellow fourth- 
graders during the task. Fourth grade pupils were observed showing 
more incidents of behavior, other than expected behavior, in the areas 
of (1) competition; (2) asking questions about procedure; (3) pupil 
correcting the teacher’s errors; (4) giving a response without waiting 
for verbal cue; and (5) inattention at the time of reinforcement. 

Fourth grade subjects were also higher in incidents of observed variant 
behavior than fifth and sixth grade subjects in items 12, 13 , 14, and 
15 of Table 5.17, but the number of incidents form a very small percent 
of the:total number of incidents. Fourth grade subjects observed behavior 
generally followed a pattern of non-conformity with several types of 
variant behavior recorded. 

Fifth grade subjects recorded more incidents of behavior, other 
than procedural, than the fourth or sixth grade in the areas of, (1) 
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verbal interactions other than the task, (2) distraction by peers, 
and (3) concern with experimenter observation, but in all three cases 
these were only slightly higher. Fifth graders' observed behavior 
generally established a pattern of concern over what others in the 
room were doing. 

Sixth grade subjects showed higher incidents of overt behavior 
in the areas of, (1) laughter or anxiety about pronunciation, (2) anxiety 
over incorrect responses, and (3) teacher correction of pupil work. 

These were also only slightly higher than those recorded for the other 
two grades. A pattern of conformity to expectation and anxiety about 
deviation from the expected was observed in sixth grade subjects. 

Sixth grade subjects indicated little if any secondary need for 
procedural training. Results of recorded behavior in items 3» 11 » 12, 

13, 1^ and 15 which all dealt with behavior related indirectly to pro- 
cedure showed very low totals for sixth grade subjects. Sixth graders 
followed procedures with little error. 

Total results per item of observed behavior show that subjects 
in this experiment were producing behavior other than that expected 
most often in the three following areas: 

(1) Competition (item 1) ninety-six incidents. 

(2) Verbal interactions other than the task (item 2) seventy- 
five incidents. 

(3) Questions about procedure (item 3) seventy-four incidents. 

Table 5.17 was organized to show the total number of incidents per 

item in rank order, and the three items above accounted for forty-one per 
cent of the recorded incidents of overt behavior for all three grade level 
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General Observat i ons 

The experimenter recorded many observations of a general nature 
which could not be classified in a specific category. These observa- 
tions were listed here to furnish a record of these statements and to 
provide data of all observations, not only those which were adaptable 
to specific classification. 

The following were the general observations, listed chronologically 
as these statements appeared in the experimenter's records. 

1. The volume levels of the voices of each group were adjusted 
to fit the noise level of that group. 

2. In Condition III, one pair realizing that the cards would go 
to the other subject after going through the stack, worked o»*t a system 
wherein they placed the cards in his hands so the cards would be ready. 

3. Students attended to task much more readily on second and 
third days. Procedure went better--1ess talk — less reluctance to begin. 

4. More uniformity of task behavior within Condition V than 
any other. Less behavioral variation between Monday and Tuesday for 
Condition IV, self-instruction, than for any other. 

5. Teac^^er-subject smiling at correct response seemed to serve 
as a reinforcer. 

6. Procedures were much more fluent during second day learning 
trials. Pace remained more constant. 

7. Anticipation of the second day's task indicated by pleasure 
with which student accepts the new set of cards. 

8. Procedural rapport with various groups seems to affect the 
work attitude of the particular group of eight. The groups who asked 
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questions about procedures were more diligent in their study efforts 
during trials. Answers to questions seem to add a new “set" toward 
group study. 

9. Third day trials were more efficient in the time and effort 
because of habituation of procedures. Does this affect learning? With 
a lessening of anxiety about procedure does learning become more effec- 
tive or less effective? 

10. One pair in Condition I and II followed a pattern of extreme 
concentrated study concentrating on each pronunciation and response. 
Teacher often stared at wall while making up his mind about pronuncia- 
tion. Teacher's lack of action eventually caused pupil to reprimand 
him. Pupil wanted more rapid trials. 

11. There may be some additional learning by those who finish 
first and attend closely to other pairs working in the room; however, 
this attention was very seldom evident as pupils who finished first 
usually talked quietly together. 

15. One pair continued to keep track of errors on trials, numeri- 
cally counting them. 

16. Procedures seem to bother students with low reading scores. 
They spent much time trying to remember how they were to do it and 
consequently slow processes resulted. 

17. Condition III, teacher and pupil, seems to present the best 
teacher-pupil relationship because both know they are going to be in an 
equal status position. 

18. Very little interaction of any kind was observed in Condition 
IV. Some glancing at one another to see how many cards were left in 



each other's stack. 
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19. Regard for other pairs was evident in a subdued use of voice. 

20. Near the end of the task some pupils became lackadaisical 
about their efforts in Condition 1 and II. Task may be boring at this 
point for some^or too long for slow pupils. Actions indicating boredom: 
not attending tc card or response, slouching down in chair, looking at 
ceiling, avoiding task until prompted by partner. 

21. Condition III seems to provide incentive for both subjects 
that is not evident in I and II. Interest remains high throughout task. 

No bored individuals or pairs observed. 

22. There is no question that Condition IV, self-instruction, 
is an efficient procedure as time goes but there is some concern with 
the need for reinforcement by another person of the completed task, 
especially on second and third day of learning task. 

23 . Social prestige is probably an important factor when the 
subject gets to play the teacher's role. 

24. Almost all sub-vocal responses in Condition IV are accompanied 
by lip movement and in some cases the responses are whispered. 

25. Teacher-subjects at the fifth grade level appeared adept at 
keeping pupil on the task. 

26 . Enthusiasm of those in Condition III, teacher and pupil role, 
is remarkable. While subjects in other conditions show enthusiasm, these 
subjects can hardly wait to get on with the task. 

27 . Very few unusual occurrences were discovered at the sixth 
grade level. Maturity makes it possible for these people to do this 
task with great attention. 

28 . The pace was almost identical for all four pairs within each set. 
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of eight subjects In Condition I, teacher role only, and II, pupil 
role only, at sixth grade level. Practically complete attention to the 
task was observed throughout. Pairs finished within thirty seconds of 
one another. 

29 . The seeming maturity of sixth grade subjects in contrast to 
fourth and fifth grade subjects was noted. 

A COMPARISON OF COMPARABLE LEARNING CONDITIONS IN THE 
VAN WAGENEN STUDY OF 1962 AND THIS STUDY 

The task used in this study was developed and first used in 1962 
in a doctoral study by R. Keith Van Wagenen at the University of Utah. 
His study is reported as Chapter II of this report. The same Immediate 
Retention Test was used in both studies. 

Van Wagenen also established four learning conditions in that study 
which were: 

Condition 1. Direct oral responding to a teacher-experimenter 
on every fourth item in the task. The subjects in this treatment were 
reinforced directly by the teacher on one-fourth of the items in the 
task and had a vicarious listening experience on the other three-fourths 
of the Items. 

Condition 2. Vicarious experience on all items. There were four 
other subjects in a set of eight, including the four in Condition 1, 
who never responded directly to the teacher-experimenter but only read 
and listened to the response made by subjects in Condition 1. 

Condition 3 . Was identical to Condition IV in the present study 
except that a teaching machine was used to present the task in the Van 
Wagenen study. 
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Condition 4. Was the same as Condition 3 above except that in 
addition to the feedback by the machine the teacher-experimenter also 
gave verbal reinforcement (no comparable condition in the present study). 

Table 5.18 shows the comparisons that can be made between the data 
derived from the Van Wagenen study reported in Chapter II and the data 
from the present study. 

Table 5.19 was constructed to show the means and standard deviations 
for each comparable learning condition with data for each study from 
correct responses on the Immediate Test. 

The means for the comparable learning conditions were nearly 
identical for the subjects completing the tasks by use of flashcards or 
teaching machines (Comparison No. 1). Differences between the two means 
were very slight, but to test the significance of the difference, the 
means were treated with a t test. The results of this test (t = .59 
with 84 degrees of freedom, t equal to 2.00 needed at the .05 level) 
showed no significant difference between the means of these two groups. 

The second comparison which was feasible between the two studies 
involved subjects who at no time received verbal feedback or reinforce- 
ment. These means show in Comparison No. 2 in Table 5.19 were also 
tested by use of a t test and again no significant differences were 
noted (t = 1.73 with 84 degrees of freedom, t equal to 2.00 needed at 
the .05 level). The difference which neared significance in this com- 
parison favored the vicarious subjects of the Van Wagenen study who heard 
the reinforcement given to others. The subjects in the present study 
did not hear it given to other subjects by another person, but gave the 
reinforcement to other subjects and thereby heard their own voices giving 
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TABLE 5.18 

COMPARABLE REINFORCEMENT CONDITIONS FROM THE VAN WAGENEN 
STUDY (CHAPTER II ) AND FROM THE PRESENT STUDY 



Van Wagenen 
Study-Chapter 1 1 


Myers Study 
This Chapter 


Condition III - Self-instruction 


Condition IV - Self-instruction 


Condition II - Pupil observed but 


Condition 1 - Pupils in teacher’s 


did not respond 


role observed but did not respond. 


Condition 1 - That portion of this 


Condition II - Pupils reinforced 


condition in which pupils were 


100 per cent by teacher pupil. 


reinforced 100 per cent by teacher 
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TABLE 5.19 

MEANS AND STANDARD DEVIATIONS FOR COMPARABLE 
CONDITIONS IN THE VAN WAGENEN STUDY 
AND THIS STUDY 



Comparison Description of Comparable 
No. Learning Condition 


Van Wagenen 


Myers 








X 


SD 


X 


SD 


1 


Subjects received task and 
feedback from machine or 
flashcard . 


29.22 


8.59 


29.35 


6.39 


2 


Subjects who did not receive 
feedback directly (vicarious 
or pupi 1 as teacher) . 


26.30 


5.87 


25.35 


7.17 


3. 


Subjects who received 100 
per cent direct verbal 
rei nfor cement . 


32.70 


12.23* 


31 .10 


6.34 



*Only one-fourth of the items were directly reinforced in the Van 
Wagenen study and this mean and standard deviation represent only the 
scores on those items multiplied by 4. 
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the feedback. 

Comparison No. 3 was made to show the differences between subjects 
who received direct verbal reinforcement from a teacher-experimenter 
and those who received direct verbal reinforcement from another pupil. 

The results of this comparison were indicated in a t test (t = 2.39). 

The resulting t was significant at the .05 level and favored the subjects 
in the Van Wagenen study who were reinforced by an adult experimenter. 
This comparison was approached cautiously because in this study all items 
were reinforced and in the Van Wagenen study the subjects received rein- 
forcement directly on every fourth item and only these fifteen items 
were used in the comparison multiplied by four to make the means roughly 
comparable . 



Summary of Fi nd i ngs 

This study was completed in order to investigate learning condi- 
t ionsi nvolvi ng pupil-pairs working together on a verbal learning task 
and pupils working alone on the same task. Four learning conditions 
were established; three involving pupil pairs and one with pupils working 
alone. Subjects were 192 elementary school pupils equally divided 
between fourth, fifth and sixth grade levels. Subjects were also equally 
divided across learning conditions. Subjects were paired by reading 
score and sex. Sets of subjects for each learning condition were equated 
by reading scores. 

The task consisted of sixty German stimulus nouns each with two 
English responses. The task was to learn the German noun and the one 
English response which had the same meaning as the German word. The 
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task was divided into three equal parts and one part was given on! each of 
the days, Monday, Tuesday and Wednesday with a recognition test given 
on Friday for all sixty items. A delayed test of retention of the task 
was also given to all subjects. 

, Time data was collected on all subjects by recording the time 
each subject or pair of subjects required to complete each day's task. 
Observations of behavior other than procedural behavior was recorded 
by the experimenter. 

Findings in the study were examined across learning conditions, 
grade level, and comparative conditions with a previous study. The 
findings were as follows; 

Condition I and II (a pupil pair with one subject [\2 >n the 
teacher role and the other subject in the pupil role U\J ). Subjects 
who served as teachers in the pupil pair learned significantly less 
than subjects in all other conditions. These subjects also had the 
poorest efficiency scores when time to learn the task was considered. 
Subjects in Condition II, the pupil role, had significantly superior 
learning to all other conditions on the immediate test but learning 
was superior only to Condition I subjects on the delayed test. Effic- 
iency scores for subjects in Condition II were second only to subjects 
who worked alone. 

Condi t ion 1 1 I (pupil pa i rs where subjects switched teacher-pupil 
roles at mid-point in each day's task). Subjects in this learning 
condition learned significantly more than those in Condition I and 
showed the least percentage of loss from the immediate test to the 
delayed test. Poor efficiency scores were recorded. Change of role 
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appeared to make this condition more motivating to subjects than other 
cond i t i ons . 

Condition IV (pupil working alone with self-instruction materials). 
These subjects were second only to subjects in Condition II on total 
learning scores. Subjects in this condition learned the first day’s 
task better than any other subjects, but showed next to the lowest 
learning on the third day's task. This decrement was in part respon- 
sible for one double interaction in the analysis of variance being 
significant. Subjects in this condition had the best efficiency scores 
requiring less than half the time per item learned than two other 
condi tions . 

When learning scores were compared across grade 'evels no signifi- 
cant differences were found; however^ some differences attributable to 
grade level were evident. 

Grade Four . Subjects from this grade level had higher scores in 
pupil-pairs than when working alone on the task. Efficiency scores for 
this grade level were poorest of the three levels. Subjects in this 
grade were the most likely to show behavior other than procedural; these 
subjects accounted for forty-one per> cent of such behavior recorded 
by the experimenter. 

Grade Five . These subjects appeared to be the norm of the three 
levels. Fifth graders showed good learning in all conditions and had 
efficiency scores which were better than fourth graders, but not as 
favorable as sixth graders. 

Grade Six . Subjects at this level showed excellent scores when 
working on the task alone, and were less effective when working in pairs. 
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Efficiency scores were superior to other grade levels. Observed behavior 
indicated these subjects had a great desire to conform to peer and experi- 
menter expectations. 

Some comparisons made with the Van Wagenen (Chapter II) study, which 
used the same task, indicated that subjects can learn this type of task 
as well by working in pupil-pairs or alone as subjects interacting in 
small groups with an adult. 

Impl i cat i ons : 

1. Pupils who received verbal reinforcement feedback learned more 
than those who gave it. Pupil-pairs in a classroom should be organized 

in such a way that task materials allow them to reinforce one another. 

2. Materials for self-instruction in tasks of this type produced 
efficient learning, but showed marked decrements after the initial 
learning. This decrement might be reduced by a redistribution of learn- 
ing trial time. Materials should probably be constructed to give variety 
to this experience. A teaching machine produced no better learning than 
that acquired by use of a simple self-instruction card. Efficiency in 
learning was superior with the self-instruction mode; it required sub- 
jects in this mode about half the time to learn an item than was required 
by pupi 1-pai rs. 

3. Grade levels made little difference in total learning on this 
task, but lower grade level subjects learned more working in pupil-pairs 
while higher grade level subjects learned more using a self-instruction mode. 

4. Elementary pupils learned this task nearly as well by working in 
pairs or alone as they did by interacting with an adult teacher-experimenter. 
Teachers might make more effective use of their time by organizing their 
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classrooms so that the rote memory tasks are taught by one pupi 1 to 
another or by a self-instruction device 

5* Subjects who reverse teacher-pupil roles when working in pairs 
showed about equal learning by both members of the pair. Some implica- 
tions for classroom organization are evident here, but additional data 
should be sought which would give information on learning when switching 
takes place at an increased rate; perhaps on every other item. 

6. Spacing of learning trials or a different time distribution 
of these trials might reduce the learning decrement across days of the 
task. Additional experimentation should be conducted in this area. 
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In the previous study, the pupils working in pairs were matched 
with respect to their scores on a reading test. The research workers 
were inclined to believe that the amount of learning taking place would 
be influenced by the extent to which the pupils in a pair were or were 
not of comparable achievement, but limitations had to be placed on the 
size of the study and only pairs with matched pupils were included. A 
second study was planned in which pupils mismatched with respect to 
achievement were included as well as pairs in which the pupils were 
matched. While one can list a number of ways in which the effects of 
mismatching or matching influence the learning of the pupil pairs, 
it is difficult to make a predicition concerning the way in which the 
over-all effect wi 1 1 operate. In the mismatched pairs, the high achiever 
functioning in the teacher role might be expected to urge on his low 
achieving pupil, but there is also the possibility that he might become 
impatient and irritated with him. When the low achiever is functioning 
as a teacher he might well pace the task at too low a rate for the 
efficient learning of his high achieving pupil. ^ Other problems might 
well be produced by the fact that high achievers do not typically choose 
to work with low achievers when the choice is up to them and this mis- 
matching might well provide uncongenial work partners with a resulting 
deterioration in learning. Rather than stating hypotheses, the purpose 
of this study was to provide an over-all estimate of the total effect of 
these various factors on learning in situations in which pupils teach 
pupils and in which the pupils were mismatched with respect to academic 
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The previous study also raised a number of questions concerning the 
capacity of pupils to work alone at different age levels. The data 
suggests that the 4th grade pupils worked best in pairs, but by the 
sixth grade the children working alone were as effective as those working 
together. An interesting question is whether a similar trend continues 
through the junior high school level with the older pupil becoming better 
at working alone than in pairs. 

Thus, the study reported in this chapter is a continuation of the 
study reported in Chapter V in that it reproduced some of the conditions 
of the latter study but at the 7th and 8th grade levels. In addition, 
the study also explores further the effect of matching and mismatching 
pupil pairs in terms of academic achievement on the performance of the 
pupils in a learning situation requiring the cooperation of each member 
of a pair. 



PROCEDURE 

Learn i nq Cond i t ions 

Condition I. Pupils matched on the basis of grade point average 
undertook work in pairs with one pupil functioning as the teacher and the 
other as the learner. This condition refers to the pupil teacher. 

Cond it ion II . This is the same as Condition I except that it 
refers to the pupil learner rather than the pupil teacher. 

Condition III . Pupils worked in pairs as in the previous two 
conditions but they were mismatched with respect to grade point average. 
This condition refers to the teacher in each pair. 
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Condition |V . Pupils worked in pairs as under the previous condi- 
tions but the pairs were mismatched. This condition refers to the pupil 
member of the pair. 

Condition V. The pupils worked alone using the cards as a self- 
instructional device. 

Sub iects 

The subjects for this study were derived from English classes in 
a junior high school in Salt Lake City. Since the policy of the school 
is to separate pupils for the purposes of English instruction into high 
level classes and low level classes, pupils had to be drawn from both 
of these sets of classes. Eight pupils were drawn at a time and taken 
to a room where the experiment was conducted. The grade-point average 
for each child was computed for his or her work covering the three previous 
semesters, but only grades from academic solids were included in the 
computation. Pupils from the 7th and 8th grades were included in the 
study, but the two grade levels were kept separate in all experimental 
treatments. A total of 160 subjects were included in the study, 80 
at each grade level and 32 to each of the five learning conditions. 

The mean grade-point average for subjects in each grade and each 
condtion are shown in Table 6.01. The mean difference in grade-point 
average for the matched pupils was 0. l6 and the corresponding difference 
for the mismatched pupils was 1.21. 

Task 

The task was identical with that used in the previous studies in 
this series with the pupil pairs working with small cards with thequestion 
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TABLE 6.01 

MEAN GRADE POINT AVERAGES FOR SUBJECTS BY 
GRADEp SEX AND CONDITION 





7th Grade 




8th 


Grade 




Condition 


Girls 


Boys 


Total 


Girls 


Boys 


Total 


Condition 1 
Matched Teacher 


2.6 


2.2 


2.4 


2.7 


2.1 


2.4 


Condition 1 1 
Matched Pupi 1 


2.4 


2.2 


2.3 


2.5 


1.9 


2.2 


Condi t ion III 
Mismatched Teachers 


3.1 


2.1 


2.6 


2.6 


2.4 


2.5 


Condition IV 
Mismatched Pup i 1 s 


2.6 


2.0 


2.3 


2.9 


2.2 


2.5 


Condition V 


2.1 


1.6 


1.8 


1.9 


1.7 


1.8 
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on one side and the answer on the other. Learning took place on three 
successive days , -Monday , Tuesday, and Wednesday - and the children were 
tested on Friday. 

The data fitted a 2 x 3 x 5 repeated measures design. One hundred 
and sixty seventh and eighth grade students undertook a learning task on 
three successive days and were assigned to one of five experimental 
conditions. Ss worked in teacher-pupil pairs in the first four condi- 
tions. In condition five, Ss worked alone with the task. 

The task consisted of learning 60 German words and their English 
equivalents. These words were learned in groups of 20 on Monday, 

Tuesday and Wednesday of each week. Data were gathered from a multiple- 
choice test given each pair across the five conditions to complete the 
task. 

Administration of Task 

The task was administered as in the previous study. One of the 
experimenters kept a record of the time required for each pair of pupils 
or for each pupil for the pupils working alone. 

RESULTS 



The results will be reported first in terms of the scores on the 
achievement test, and then in terms of the time data. 
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Retention Data 

Data on individual scores from the retention test was initially 
examined by computing means and standard deviations for each condition, 
grade and task. These values are presented i n Table 6.02. A brief 
examination of this data shows that the means for Task A are uniformly 
higher than those for Tasks B and C. In addition, Means for Task B 
are consistently higher than those for Task C. It is not clear from 
this table whether there are any uniform differences as a function of 
grade level. 

In order to examine the differences in amount learned as a function 
of condition. Table 6.03 was constructed. Means for all conditions were 
higher for Task A than for Task B and higher for Task B than for Task C. 

With the exception of identical means (8.5) for Condition I and 
II on Task B, the pupils from matched pairs and pupils from mismatched 
pairs did better than their corresponding teacher partners across all 
tasks (days) . 

Tables 6.02 and 6.03 showed differences in amounts learned on each 
task (day) and also differences in amounts learned on each task as a 
function of condition. In order to examine the significance of these 
differences the analysis of variance shown in Table 6.04 was computed 
with three main effects: tasks, conditions, and grade levels. 

Main Effects 

The main effects which were significant were conditions (p<. 05) 
and tasks (p<.001). Differences due to grade level were not signifi- 
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TABLE 6.02 



MEANS AND STANDARD DEVIATIONS OF RETENTION SCORES FOR GRADE LEVEL, 
CONDITIONS AND TASKS— CUMULATI VE GRADE POINT AVERAGES FOR 
CONDITIONS INCLUDED (N=l6 CASES FOR 
EACH GRADE BY CONDITION) 









A 






B 




C 


Total 




Condi t i on 


Grade 


CGPA 


X 


Z 


X 


Z 


X 


Z 


X 


Z 


1 


*7 


2,4 


8.9 


2.89 


8.5 


3.63 


8,0 


3.33 


25.4 


3. 12 




8 


2,4 


10,3 


2.86 


8.5 


3.84 


6,8 


2.27 


25.6 


3.22 


1 1 


7 


2.3 


11.7 


3.71 


8.3 


3.29 


7.8 


2,69 


27.8 


3.76 




8 


2.2 


11.7 


3.03 


8,7 


3.35 


8.3 


3.09 


28.7 


3.24 


III 


7 


2.6 


9.6 


3.76 


8.1 


2.74 


7.0 


4.24 


24.7 


3.08 




8 


2.5 


11.3 


4,41 


9.5 


4,87 


7.8 


2.93 


28,6 


3. 10 


IV 


7 


2.3 


12.9 


3.67 


10,0 


3.61 


8.4 


2.90 


31.3 


3.41 




8 


2.5 


12. 1 


3.65 


9.8 


3.06 


9.5 


3.41 


31.4 


3.49 


V 


7 


1.8 


12.0 


3.54 


9.0 


2.32 


6.9 


3.23 


27.9 


3.71 


8 

Mean and Standard 


1.9 


11.0 


3.57 


8,8 


3.05 


6.9 


2,72 


26.7 


3.52 


Deviations 


for Tasks 


11.1 


3.26 


8.9 


2.50 


7.7 


2.98 


27.7 
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TABLE 6.03 

MEANS AND STANDARD DEVIATIONS OF RETENTION DATA 
FOR TASK BY CONDITION WITHOUT REGARD 
TO GRADE LEVEL 





A 






B 




C 


Total 




Condi ti on 


X 


Z 


X 


Z 


X 


Z 


X 


Z 




















1 


9.6 


2.72 


8.5 


3.59 


7.4 


2.76 


25.5 


3.15 


1 1 


11.7 


3.19 


8. 5 


2.97 


8.0 


2.78 


28.2 


3.33 


1 1 1 


10.8 


2.83 


9.1 


2.85 


7.4 


2.98 


27.3 


3.17 


IV 


12.5 


3.^9 


9.9 


3.13 


8.9 


3.15 


31.3 


3.56 


V 


11.6 


3.53 


8.9 


2.66 


6.9 


2.93 


27.4 


3.60 


Means and 
Standard 
Devi at ions 
by Task 


11.2 


3.26 


8.9 


2.50 


7.7 


2.50 
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TABLE 6.04 

ANALYSIS OF VARIANCE OF RETENTION TEST SCORES FOR 
TASKSp GRADES AND CONDITIONS 





Ss 


df 


Ms 


F 


p 


Between Ss 


2741.50 


159 








Grades 


11.72 


1 


11.72 


.71 


• 

CO 

• 


Condi t i ons 


197.95 


4 


49.49 


3.00 


.05 


Grades x Conditions 


56.64 


4 


14.16 


vO 

00 

0 


N.S. 


Residual Between Ss 


2475.19 


150 


16.50 






Within Ss 


1638.00 


320 








Tasks 


1013.68 


2 


506.84 


312.86 


.001 


Tasks X Grades 


.71 


2 


.36 


.22 


• 

CO 

0 


Tasks X Conditions 


80.96 


8 


10.12 


6,25 


0 

0 

0 


Tasks X Grades x Conditions 


55.53 


8 


6.94 


4.28 


0 

0 

0 


Within Residual 


487.12 


300 


1.62 







Total 



4379.50 



479 
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As Table 6.05 indicates, all Ss in Condition II (Ss acting as 
pupils) learned significantly more than did their corresponding teacher 
partners (Condition I). It is also apparent that the pupils from mis- 
matched pairs in Condition IV learned a significantly greater amount 
than did Ss in any other condition. 

Interactions 

The analysis of variance of the retention data yielded a double 
and a triple interaction which will be examined in that order. 

The double interaction was between Tasks and Conditions and was 
significant at the .001 level. Figure I represents this interaction. 

It can be seen that the interaction was largely due to the sharp 
decrement across days in Condition III, teachers f rom mi smatched pairs, 
and Condition V, pupils working alone with the task. 

The triple interaction in Table 6.04, Task x Grade x Condition was 
significant at the .001 level. Because it is generally agreed that the 
source of variation in a triple interaction is difficult, if not 
impossible to represent, no analysis of this interaction was attempted. 

Time Data 

The time data was obtained by recording the length of time taken 
by each pair or individual within a condition to complete each day's 
task. (Example: mean number of minutes for a pair on a particular 

day's task was 10 3/4 minutes). 

Table 6.06 presents the means and standard deviations for amount 
of time taken per task, condition and grade level. It should be noted 
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TABLE 6.05 

DUNCAN“S MULTIPLE RANGE TEST FOR SIGNIFICANCE OF DIFFERENCE 
IN AMOUNT LEARNED BY EACH CONDITION 



Cond i t i on 


1 


1 1 1 


V 


1 1 


IV 


1 


— 


N.S, 


N,S. 


. 05 - 




1 1 1 




— 


N.S, 


N.S, 


.01 


V 






— 


N.S. 


.01 


1 1 








— 


.01 


IV 













* The .05 refers to the level of significance of the difference between 
Condition II and I (p<. 05, Condition I < Condition II), 

** The .01 refers to the level of significance of the difference between 
Conditions I, III, V and II (p<.01, Condition I, Ml, V and IK 
Condi t ion IV), 

N,S,=Not significant 









mmmmiimmmm 







6.12 
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B 



C 



Fig.^6‘. 01 Mean retention scores demonstrating double 
interaction of tasks x conditions for retention data. 












6.13 n 

TABLE 6.06 

MEANS AND STANDARD DEVIATIONS OF TIME DATA FOR 
GRADE LEVEL, CONDITION AND TASK 









A 


B 






C 


Total 




Condi t ion 


Grade 


X 


Z 


X 


z 


X 


Z 


X 


Z 


1 


7 


10.70 


1.92 


10.73 


1.88 


9.67 


2.57 


31.10 


2, 16 




8 


11.70 


3.08 


11.44 


2.54 


10, 16 


2. 10 


33.30 


2.64 


1 1 


7 


10.70 


1.92 


10,73 


1,88 


9.67 


2.57 


31.10 


2.16 




8 


11.70 


3.08 


11,44 


2,54 


10, 16 


2. 10 


33.30 


2,64 


Ml 


7 


12.09 


1.66 


10.82 


1.74 


9,61 


1.92 


32.52 


2.01 




8 


13.46 


1.89 


12.53 


1.64 


11.89 


2. 16 


37.88 


1.98 


IV 


7 


12.09 


1.66 


10.82 


1.74 


9.61 


1.92 


32.52 


2.01 




8 


13.46 


1.89 


12.53 


1.64 


11.89 


2.16 


37.88 


1.98 


V 


7 


6,77 


1.68 


6.75 


2.26 


5.77 


2.55 


19.29 


2.20 




8 


6.46 


2.68 


5.36 


2,16 


4.76 


1.72 


16,58 


2.28 


Means and 
Standard 
Dev i at i ons 
for Tasks 




10.91 


3ol7 


10.31 


2.99 


9.31 


3.07 
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on this and the following representations of the time data that the 
figures presented indicate the amount of time in minutes taken to 
complete one day’s task under a particular condition, (Example: 
seventh grade, Condition I, Task A, time taken was 10.70 minutes). 

It should further be noted that the time data for Conditions I and II 
are identical as are those for Conditions III and IV, This is inevitable 
since Condition I and II represent a matched pupil-teacher pair and 
both teacher and pupil have to take the same amount of time to complete 
the task, and Conditions III and IV represent a mismatched pupil- 
teacher pair and, again, both members of each pair take the same amount 
of time. 

A brief examination of this data shows that subjects across all 
five conditions took uniformly more time on Task A (first day) than they 
did on Task B or C (second and third day). They also took more time on 
Task B than on Task C. 

An examination of time differences by condition shows that 
Conditions III and IV (mismatched pupil-teacher pairs) took a uniformly 
greater amount of time across all tasks than did Ss in Condition I and 
II. Condition V (pupils working alone with the task) took less time 
across all tasks than did any of the other Ss, 

The table also shows that in the paired conditions the eighth grade 
Ss consistently took a longer period of time across all tasks than did 
the seventh grade Sc. The exception to this situation was Condition V 
in which the eighth grade Ss took a shorter time each day than the 
seventh grade Ss. 
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in order to examine the significance of the time differences in 
tasks, conditions, and grades the analysis of variance shown in Table 
6.07 was computed with the above mentions^d three main effects. 

Main Effects 

Time differences across the three tasks were found to be significant 
at the .001 level. A brief examination of Table 6.06 shows that the 
variance in the means of the five conditions can be accounted for almost 
entirely by the fact that Ss in Condition V took approximately half as 
much time to complete all three tasks as did Ss in the paired conditions. 

The differences in time taken within conditions were significant 
at the .001 level. A Duncan Multiple Range Test was computed to deter- 
mine the significance of these differences. This data is presented in 
Table 6,08. It can be seen that Ss in both paired conditions took a 
significantly longer time with the task than did Ss in Condition V who 
worked alone with the task. The data further indicated that Ss in Condi- 
tions 199 and iV (mismatched teacher-pupil pairs) took significantly 
longer on each day's task than did Ss in Conditions i and 9i (matched 
pupil pairs. 

8 nteract i ons 

One double interaction between grade and learning conditions was 
found to be significant at the .05 level. This interaction is represented 
in Figure 6.02, but it does not appear to be particularly meaningful. 

Ef f iciency of Learn i nq 

One of the purposes of this study was to replicate certain condi- 
tions which were examined in the study reported in Chapter V. In the 
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TABLE $.07 

ANALYSIS OF VARIANCE OF TIME DATA FOR 
TASKS, GRADES AND CONDITIONS 





Ss 


df 


Ms 


F 


P 


Between Ss 


3955.53 


159 








Grades 


81.09 


1 


81.09 


7.90 


.01 


Conditions 


2217.84 


4 


554 . 0 $ 


53.95 


.001 


Grades x Conditions 


116.72 


4 


29.18 


2.84 


.05 


Residual Between Ss 


1539.88 


150 


10.27 






Within Ss 


773.50 


320 








Tasks 


207.41 


2 


103. 10 


57.20 


.001 


Tasks X Grades 


1.12 


2 


.56 


.31 


• 

to 

• 


Tasks X Conditions 


18.08 


8 


2.26 


1.25 


N.S. 


Tasks X Grades x Conditions 


12.32 


8 


1.54 


.85 


• 

to 

• 


Within Residual 


534.57 


300 


1.81 






Grand Total 


4729.03 


479 
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TABLE 6.08 



DUNCAN'S MULTIPLE RANGE TEST FOR SIGNIFICANCE OF 
DIFFERENCES IN TIME TAKEN BY CONDITIONS 



Condition 


V 


1 


II 


Ill 


IV 


V 


-- 


.01* 


.01 


.01 


.01 


1 




— 


— 


.01 


.01 


II 






— 


.01 


.01 


1 1 1 








-- 


-- 


IV 










— — 



*The .01 refers to the level of significance of the difference 



between Conditions I, II, III, IV and V. 
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16.00 




I & II 



Ml & IV 



V Condition 



Fig. 6.02 
interaction of 



Mean time across days representing double 
conditions and grades for time data. 




study it was stated that a basic reason for recording the time data was 
to attempt to develop a measure of learning efficiency. This same 
rationale was also used in recording time data in the present study. A 
learning efficiency score was developed for each person in each condition 
and each grade level from the formula ES = -H- (ES = Efficiency Score, 

cr 

TT = Total Time, CR = Correct Responses on Retention Test). The fewer 
minutes required per correct response learned, the more efficient the 
learning was considered to be. 

The formula was applied to all cases in the five conditions and the 
summary data are provided in Table $. 09 . Condition V, self-instruction, 
yielded the most efficient learning with a score of O. 65 . The other 
four conditions required approximately double that time to learn a 
correct response. 

Summary data for the efficiency scores by grade level are presented 
in Table 6.10. Although the difference in efficiency score by grade 
level is slight^, eighth graders were slightly more efficient and seventh 
graders slightly less. 

Comparison of Learning Conditions i n this Study wi th 
Comparable Condi t i ons i n the Previous Study 

One of the main purposes of this study was to reproduce three of 
the conditions of the previous study (196A) but with older children and 
to see whether any marked trends over age could be identified. The three 
conditions which were duplicated for comparison were Condition I (teachers 
from the matched teacher-pupil pairs). Condition II (pupils from the 
matched teacher-pupil pairs) and Condition V (pupils working alone). The 
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TABLE 6.09 

LEARNING EFFICIENCY SCORES FOR SUBJECTS IN 
CONDITIONS I, II, III, IV AND V 



Cond 1 t i on 


Total Time 


Correct Response 


Time per 
Item 


1 


(Matched teachers) 


1030.50 


817 


1.26 


II 


(Matched pupils) 


1030.50 


907 


1.13 


III 


(Mismatched teachers) 


1126.50 


877 


1.28 


IV 


(Mismatched pupils) 


1126.50 


1006 


1.11 


V 


(Isolation) 


573.90 


882 


.65 
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TABLE 6.10 

LEARNING EFFICIENCY SCORES FOR SUBJECTS IN 
GRADES SEVEN AND EIGHT 



Grade 



Total Time 



Correct Response 



Time per 
I tern 



7 


2345.30 


2207 


.941 


8 


2542.68 


2282 


.897 



6.22 



II 



data from these conditions which was compared were the number of correct 
responses, the time data and efficiency scores. The previous study used 
fourth, fifth and sixth graders in a public elementary school. The present 
study used seventh and eighth graders in a public junior high school 

Figure 6.03 presents the mean number of correct responses for the 
three conditions by grade. Although it is difficult to describe any 
definite trend, it can be seen that subjects who acted as pupils (Condi- 
tion II) appeared to learn somewhat more across all grades than did 
subjects in any other condition. The graph also clearly indicated that 
subjects in Condition I (matched teachers) continued to learn less across 
all grades than did their pupil counterparts. The result of these two 
trends is for the gap between teachers and pupils to close with increas- 
ing age. 

Figure 6.04 presented the mean number of correct responses for the 
three conditions and the five grades studied. As is indicated, subjects 
in the fourth, fifth and sixth grades seemed to learn a slightly greater 
number of correct responses than did subjects in the seventh and eighth 
grades. The effect is undoubtedly motivational. 

The time taken to complete the task by grade level is shown in 
Figure 6.05. The time shows a decline across grades, but the decline does 
not correspond to an increase in efficiency. The data suggest that the 
pupils in the higher grades tend to dispose of the task more rapidly but 
that they could spend more time on it with a corresponding increment in 
learn i ng. 
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Mean number of correct 



responses 




Fig. 6.03 Comparison by grade and condition of mean 
number of correct responses in this study and the previous 
study. 
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Mean number of total 
correct responses 



29.50 
29.00 

28.50 
28.00 

27.50 

27.00 

26.50 




4 5.U 678 

Grades 

Fig. 6.04 Mean number of correct responses across 
Conditions I, II and V for the five grades which were 
compared. 



Mean number of 
minutes across 
1 1 condi t ions 
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rig. 6.05 Comparison of mean number of minutes 
across all tasks in this study and the .previous study. 
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Efficiency scores across the five grades are presented in Table 6, 1 1 <■ 
St can be seen that Ss in the seventh and eighth grades generally took a 
shorter time^ but the time per item learned was suprisingly constant. 

The higher grades learned fewer items to the point of correct performance 
on the test, a fact which probably reflected poor motivation in the 
experimental situation at the junior high school level. The elementary 
school pupil cooperates in experiments to a far greater extent than the 
more blase seventh or eighth grader. 

DSSCISSS3 0N 

The study raises a r^umber of questions of importance to the manage- 
ment of classroom learning. 

First, the differences in the performance of the matched and mis- 
matched pupils favors the mismatched group. This needs to be investigated 
further before any attempt is made to apply the result to educational 
practice. The expectation had been that the pairs containing matched 
pupils would provide the more effective learning situation, but the 
reverse was the case, A number of observations made by the experimenters 
give leads concerning the reasons for the differences found. The experi- 
menters observed that the mismatched groups there seemed to be more 
interaction between the pupil functioning as a pupil and the pupil 
functioning as the teacher. The high grade-point teacher tended to be 
more reprimanding when the pupil made a wrong answer than the teacher in 
the matched pairs. The data also indicate that the high level of per- 
formance of the mismatched pairs was a result of the high performance of 
the high-grade-point pupils in the pupil role. 
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TABLE 6.11 

COMPARISON OF LEARNING EFFICIENCY SCORES ACROSS THREE 
COMPARABLE CONDITIONS FOR SUBJECTS IN 
GRADES 4 , 5 , 6 , 7 AND 8 



Grade 


Total Time 


Correct Response 


Efficiency Scores 
Time per Item 


4 


1785.8 


1342 


1.33 


5 


1674.6 


1398 


1.10 


6 


1379.1 


1375 


1.00 


7 . 


1 304. 5 


1268 


1,02 


8 


1330.4 


1302 


1.02 
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Second, the data in this study also support the data previously 
presented indicating that while the individual working alone is the most 
efficient as a learner, his decrement over days is much greater than the 
decrement found under other learning conditions. One is left wondering 
what would have happened if the task had been spread over several weeks 
with learning sessions on each day. One suspects that the social condi- 
tions would then have shown themselves to be considerably more efficient 
than the condition which did not involve social interaction. The latter 
has been the experience of those who have studied programmed learning and 
teaching machines. Such studies have suggested that the first reaction 
to a teaching machine is highly effective learning, but soon the novelty 
falls off and a very marked decrement in learning occurs. Our data 
suggest that the endless novelty found in social interactions may have the 
effect of keeping the learner aroused and hence working at least at a 
mode rate 1 y ef f i c i ent 1 eve 1 . 

Third, the fact that there is no particularly marked increment in 
performance across the five grades studied, is, at first sight, surprising. 
Most tasks that have been studied do show a very marked increment. The 
lack of any increment in performance across grades with the present task 
may be a result of the fact that level of performance on the task is 
probably highly dependent upon motivational factors. The task is 
certainly within the ability of all of the pupils who were exposed to it. 
If they would set themselves to learn it. Differences in performance bet- 
ween Conditions II, IV and V al 1 of which involve a response- reinforcement 
paradigm must be essentially motivational in character. 
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Finally, the point must be made that in this study, as in the other 
studies involved in this series, those learning conditions which require 
the learner to respond and which then follow the response with some form 
of reinforcement are generally superior to those that depart from this 
learning formula. 
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CHAPTER VII 



ABILITY OF PUPILS TO USE POSITIVE AND NEGATIVE INFORMATION 
DERIVED FROM OBSERVING THE BEHAVIOR OF OTHER PUPILS 
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Study 

In the study reported here an attempt is made to determine the extent 
to which "observer pupils" obtain information specifically from the 
behavior of those "pupils" who interact and to discover some of the condi- 
tions that influence the learning of the "observer pupils." The paradigm 
followed in the study was that of the teacher presenting the pupil with a 
problem and a two-choice answer. The interacting pupil chooses one of the 
answers and is told if he is right. Whichever choice the interacting 
pupil makes, he provides an equal amount of information to the observer 
pupil. That is to say, the observer pupil can always determine the right 
answer from the behavior of the interacting pupil. However, there is 
considerable interest in determining whether the observer pupil learns 
as much when the interacting pupil is correct as when he is incorrect. 

Of particular importance in the present study is the extent to which 
a pupil learns from the errors as contrasted with the correct responses 
made by other pupils. Evidence from the studies of Hovland and Weiss 
(1953) and Bruner, Goodnow, and Austin (1956) indicates that, when an 
error or a correct response provides equal information in a concept 
learning task, subjects learn more from correct responses than from errors. 
Whether a similar effect occurs when one pupil observes the behavior of 
another is a matter to be determined. 

In the experiment reported in Chapter II, observer Ss were under the 
influence of varying efficiencies for direct subject oral responding. 

That is, they heard direct Ss respond correctly in some instances to all 
four presentations of some items, while for other items, 0, 1, 2, or 3 
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trials were observed by them to be correct. The purpose of that part of 
the investigation reported here was to determine the extent to which the 
efficiency of this oral responding in the learning trials influenced the 
retention of correct responses by observer Ss who had the experience of 
observing the varying efficiencies during the trials. To be borne in mind 
is the fact that knowledge of results was afforded after each item had 
been responded to, so that the same information was available to the 
observer Ss whether a direct S responded correctly or not. A comparison of 
the retention of particular items by observer Ss (as measured Friday) with 
the recorded 1 earn ing-t rials performance of direct Ss on those same items 
was undertaken. An example may suffice to clarify the procedure: assume 
a hypothetical direct subject X. Let us say that subject X was confronted 
on a Monday (or Tuesday, or Wednesday) with the item "Wahrheit" to which he 
gave audible responses. This item, as was the case with all other items, 
was presented four times at i**regular intervals. Assuming that subject 
X gave two correct oral responses to the item and two incorrect ones, 
one might suspect that the four observer Ss observing this responding 
efficiency would be uninfluenced by the number of correct performances 
since the correct choice was always indicated by the experimenter. On the 
other hand, if the observer Ss attended more closely to the responding of 
their peers than to the feedback of the experimenter, learning might be 
expected to be less effective for them if the direct S made four incorrect 
choices. 



Resul ts 

The learning of the observer subjects cannot be traced as it pro- 
gressed on particular items through the first three days of the week since 
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the only measure of learning obtained from them was administered on Friday, 
However, the course of learning for the direct subjects can be traced item 
by item as it occurred in the simulated classroom situation, and retention 
can also be measured on Friday. Since each item was presented to the 
same direct Ss on four different occassions during an acquisition session, 
the correctness of the response on each trial can be determined. One 
would expect an increasing probability of correct responses over the four 
trials. 

On every presentation of every item, the card showing the two-choice 
problem was clearly visible to all Ss, including those involved only as 
observers. The correct answer could always be determined by observing the 
behavior of the direct S and E. That is, the observer S could see the 
direct subject respond either correctly or incorrectly and E present 
knowledge of results. On each item he would have opportunity to observe 
the direct subject responding correctly between 0 and 4 times. The problem 
is to determine whether the learning of the observer subjects is related 
to the number of times they observe a correct response, even though each 
trial provides all the information necessary for learning. Now it is 
possible to find items that were responded to correctly by the direct Ss 
0 times out of 4 trials, 1 trial correctly, 2 trials correctly, etc. 

From the data it is then possible to determine the probability that an 
observer S would respond correctly to the item on Friday under each one of 
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these five learning conditions. These probabilities are presented in 
Figure 7.01 . (top) . 

The data show an increasing probability of observer Ss answering 
correctly depending on the number of correct responses to which the 
indirectly involved subject had been exposed. 

Figure 7.01 (bottom) shows the same data as for the upper graph but 
broken down according to whether the items were learned on Monday, Tuesday, 
or Wednesday. They show a positive slope for each day of learning and 
also the fact that there is declining retention across these three days. 

The latter effect may be interpreted either as a result of proactive 
inhibition or as a result of attention becoming increasingly erratic. 

Other explanations of the effect noted need to be investigated. First, 
there is the possibility that although the items were constructed to be 
equal in difficulty, there might still be substantial differences. If the 
latter were so, then the items which the direct subjects missed most fre- 
quently in the recitation situation would also be those which the observer 
subjects would be most likely to miss on Friday's test. In order to 

evaluate this explanation, an item analysis was made for an independent 

group of 200 who were exposed to a similar learning procedure and who took 

the test also on Friday. For this group a comparison was made of the 

performance on the test of those items that had been answered zero times 
by the direct Ss in the present study with those items that had been 
answered 4 times correctly. The probability of the first group of items 
(0 presentations correct) being ansv^red correctly on the Friday test was 
0.554, while the probability of the latter group (4 presentations correct) 
being answered correctly was 0.615. The difference between these two 
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Fig. 7.01 0 Probability of learning by observation 
at five levels of accuracy for the subject being observed. 
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probabilities is only 0.061, a difference very small compared with the 
difference found in the case of the observer Ss who had seen the one 
group of items responded to by direct Ss 0 times correctly and the other 
group of items 4 times correctly. The weight of the evidence supports 
the position that observer Ss learn more when they see another subject 
respond correctly than when they observe him respond incorrectly. 

The small differences in difficulty of the items on the Friday test 
might be accounted for in terms of slight differences in initial associa- 
tion value, although every effort had been made to eliminate this factor. 
That this factor is small is reflected by the fact that on the first 
trial the direct subjects would have chosen the correct answer with a 
probability of 0.500. In actual fact, the data showed a slight positive 
association value, with the correct answer being guessed with a proba- 
bility of 0.512. Nevertheless, there appear to be some small consistent 
differences. When such a probability was computed for each item for the 
direct Ss and a similar probability was computed for the so-called control 
group, the two sets of probabilities correlated C.27 (p-".0.05). 

Pi scussion 

Evidence has been presented to indicate support for the position that 
pupils who observe other pupils interacting with a person in the role of 
teacher learn more from the correct responses of the interacting pupil 
than from his errors. This occurs despite the fact that, in the situation 
studied errors and correct responses carry equal information. The situation 
parallels closely that found in concept-formation studies in which Ss 
performing tasks related to the acquisition of concepts obtain- more 
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information from their correct responses than from their incorrect ones. 

The implication of the findings for teaching is that the teacher 
should not let the correct response be known only by implication. Correct 
answers should be clearly stated and pupils shoCild not be expected to infer 
a correct answer from the errors made by other pupils. 

Study 2 

The previous study, which was based on data derived from another 
experiment suffered from the defect that it did not provide a means of 
controlling for differences in association value of the different items. 

The phenomena involved appeared to be of sufficient interest to merit the 
undertaking of a separate experiment which would have as its central 
purpose the exploration of the effect of hearing the right answer a given 
number of times and the wrong answer a given number of times in a learning 
situation in which both right and wrong answers provide equal amounts of 
information. In this second experiment an advantage was to be gained by 
the fact that right and wrong answers were to be given by stooges who, 
over the series of learning trials, could be cued to give the wrong answer 
either zero times, or once, or twice, or three times or four times. The 
experiment could be so arranged that the items for which the correct 
answer was given zero times would, on the average, be equal in association 
value to the items for which the correct answer was given once. The same 
applied to the items, to which a correct answer was given twicer, three times 
and four times. Observer Ss were to learn by observing the performance of 
the stooges as these responded correctly to items zero, once, twice, three 
times, or four where the proportion of items was equal under the five 
categories . 
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Method 

Materials . The tasks which were used in the previous experiment 
were used in this study, but with some rearrangement of the materials. 

Task C was given on the first day, and was followed on subsequent days by 
Task A and B. This change in the order of the tasks served another purpose 
and need not be considered here. However, an important rearrangement of 
the items within each task was made. The purpose of this rearrangement was 
to classify the items into five groups so that the average association 
values for each group was the same. The basis for this classification was 
the extent to which subjects in the experiment reported in Chapter II had 
been successful with individual items in the task. According to the 
criterion performance of these subjects, the items were ordered in des- 
cending difficulty and then assigned to the categories of 0, 1, 2, 3 and 
4 by which stooges were to respond during the learning trials. Items in 
any group were to be answered by the subjects by a given number of right 
and wrong answers over the four trials. The order of correct and incorrect 
responding was programmed so as to provide a like-random order of respond- 
ing to items under the five categories. To illustrate this, the order 
of correct responding for itei.T number 1 could have been right-wrong-right- 
wrong and for item number 2, it could have been wrong-wrong-right-wrong. 

All of the possible combinations of correct responding were employed in the 
task. Each group of items was assigned to such a response category before 
the experimental data were collected. 

Subjects . All the subjects (N=l80) came from grades 4, 5 and 6 in two 
elementary schools of Tempe, Arizona, a University community of 40,000 
inhabitants. There were six classrooms representing three grade levels at 
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each of the two schools. The four subjects from a school classroom who 
were to be those responding to the items and who were to be the stooges in 
the experiment were separated from the class and instructed that they 
were going to help in the conduct of an experiment and that they were to 
perform exactly as instructed. These pupils were selected by the teacher 
as those most likely to cooperate. They, were shown the general nature of 
the task and were told that when the experimenter's thumb appeared in 
front of the card that they were always to choose the first answer and when 
they could not see his thumb in front of the card that they were to choose 
the second answer. Each one of the four stooges was then given a practice 
trial to make sure that he understood the instructions. They were also 
told that they must not tell any of the other children in the school that 
they had been given special instructions. Classroom teachers made the 
selection of stooges (two girls and two boys) on the basis of the teacher's 
expectation of whom among her pupils could be relied upon to keep a 
secret. Care was taken to ensure that none of the subjects had any 
familiarity with German. Each group consisted of 4 stooges and approximately 
25 pupils who learned by observation. 

Task administration conditions . The administration conditions 

involved a classroom situation in which the children were seated in their 

normal desk positions. The names of the four stooges were called and 

o 

these took positions in a straight row in front of the class at a 45 angle 
to the class on the right and the experimenter on the left. The experi- 
menter also stood in front of the class with the large cards presenting 
the German words. The pupils learned the task at the same hour each day 
on Monday, Tuesday and Wednesday, and were tested in their regular classrooms 
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on Friday. Both the stooges and the other pupils were told thdt they 
would be tested sometime for their knowledge of the German words presented. 
The feedback conditions were identical with those provided in the first 
study reported in this series. When the correct answer was given the 
experimenter said "right" or "that's right." When the pupil gave the 
wrong answer the experimenter said nothing. The task was administered 
only to the stooges. The remainder of the pupils learned the task by 
observation 

Criterion task . The same test which was given in previous experiments 
was given on Friday to all those participating in the learning sessions. 

Treatment of the data . When a check was made on the difficulty 
level of the items that had been assigned to the five different learning 
conditions (observing 0 correct responses, 2 correct responses, etc), it 
was found that the groups of items did not have the equality which they 
were supposed to have. In order to provide for the equality of the groups 
of items, an essential condition of the experiment, an item was discarded 
from each group of five items, leaving four items in each learning condi- 
tion. The discarding of the items was undertaken after the data had been 
collected, but the procedure introduced no bias into the results since it 
was based on data collected in an earlier study. 

The test given on Friday was then scored for three groups of 15 items, 
and not for three groups of 20 items as was the case in the other experi- 
ments reported in this series. 



Results 

The means and standard deviations relevant to the analysis are 
presented in Table 7*01. There is certainly no trend comparable to the 
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TABLE 7.01 

MEANS AND STANDARD DEVIATION FOR SCORES ON 
ACHIEVEMENT TEST ADMINISTERED FRIDAY 







0 right 


1 right 


2 right 


3 right 


4 right 


Means by 
Days 


Monday 


M 


1.46 


1.12 


1 .01 


1 .05 


1 .32 


5.96 




SD 


-.83 


.88 


.85 


.85 


..86 




Tuesday 


M 


1.46 


1 .38 


1.57 


1 .17 


1.15 


6.72 




SD 


.90 


.87 


.95 


.92 


.85 




Wednesday 


M 


1 .09 


0.99 


1 .30 


1.22 


0.95 


5.55 




SD 


.94 


.86 


.86 


.87 


.90 




Means by 
number of 
right 
responses 
observed 




3.99 


3.49 


3.88 


3.44 


3.42 
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trend evident in the previous study. Indeed, if there is any trend at 
all it is in the reverse direction with the greater amount of learning 
occurring under the condition where the right answer was observed 0 times 
dur i ng the series , 

An analysis of variance was undertaken in order to test whether the 
five different learning conditions had a differential effect on learning. 

The analysis of variance is shown in Table 7.02. Signfiicant differences 
were found between days, between learning conditions, and for the inter- 
action of days and learning conditions. An examination of the table of 
means to determine, if possible, the nature of the significant effects, is 
not particularly helpful. The lack of orderly arrangement of the means in 
terms of magnitude suggests that the main effect might be due to dif- 
ferences between item groups which did not appear in the data on the basis 
of which the items were assigned to learning conditions. For some unknown 
reason, this particular group of subjects may have found certain groups of 
items more easily learned than others and the main effect of learning 
conditions may have been produced by an uncontrolled condition of this kind. 

The interaction is presented in graphical form in Figure 7.02. The 
graphical representation suggests that on the first day of learning, 

Monday, that the items which were answered correctly every time or zero 
times were the most readily learned, while on the second and third day 
the items answered correctly once, twice, or three times were the most 
readily learned. The interaction may represent a genuine effect and 
reflect a tendency for subjects to learn how to learn this particular task. 
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TABLE 7.02 

ANALYSIS OF VARIANCE FOR TEST SCORES OF PUPILS 
LEARNING BY OBSERVATION 



Source 


df 


Ss 


Ms 


F 


P 


Between Subjects 


148 


197.08 








Within Subjects 


2086 


1592,4 








Between Days 


2 


20.93 


10.47 


14.30 


<01 


Between Learning 
Conditions (0-4 
right) 


4 


14.46 


7.23 


9.90 


<.01 


Days X 0-4 


8 


39.55 


4.94 


6.75 


<.01 


Residua 1 


2072 


1517.3 


.73 
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Mean 

Score 



1.6 
1.5 
1 .4 
1 .3 
1 .2 
1.1 
1 .0 
0.9 




Fig'. 7.02 Graphical representation of interaction from 
Table 7.02. 
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The Learn! nq of the Stooges 



The data derived from the stooges is of some interest in that, while 
they responded to the situation, the response made was as often wrong as 
right. Table 7*03 shows the means and standard deviations for the group 
of 23 stooges tested with the other pupils on the Friday of the week 
during which the task had been undertaken. 

The means show a systematic variation from the items that were 
responded to zero times correctly to the items that were responded 4 times 
correctly . The items from which the greatest amount was learned were those 
to which the stooge sometimes responded correctly and sometimes incorrectly. 
The mere fact that a correct response was made and that the experimenter 
said "that's right," does not in itself make for a favorable learning situa- 
tion. While the stooges had the same amount of information necessary 
for learning as did the pupils who learned by observation, they learned 
considerably less. When the means given in Table 7*03 are compared with 
the means given for the pupils learning by observation, they are found to 
be in all five instances at a lower level. The sign test involved in this 
comparison provides a significance of the differences at the .03 level. 

The data on the stooges give support to those who doubt the value 
of the response-reinforcement paradigm for planning or understanding the 
verbal learning of children. The reinforcement position would lead one 
to expect that the group of stooges that made the correct response four 
times and who were reinforced for so doing would learn the task the best. 

In actual fact they did not and the clear-cut trend is for the best 
learning to occur when some overt incorrect responses were deliberately 
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An Overview of Both Experiments 

The studies have not been able to produce any consistent evidence 
concerning the capability of pupils to benefit from positive and negative 
i nformat ion when the positive and the negative are equally informative. 
While the results of the first experiment are consistent with previous 
findings, they are not supported by the results of the second experiment. 
The inconsistency was hardly expected. 
















7.18 



il 



References 

Bruner, Jerome S., Goodnow, Jacqueline J., and Austin, George A. A 
Study of Thinking . New York: John Wiley & Sons, Inc. 1958. 
Hovland, Carl I., and Weiss, Walter. Transmi smi ssion of information 
concerning concepts through positive and negative instances. 
exp . Psychol . . March 1953, ^5, 175"l82. 




CHAPTER VIII 
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Introduction 



During the present decade there has been a revival of interest in the 
function of education in developing creative talent, an interest which 
had reached a peak a quarter of a century earlier under the sponsorship 
of the progressive education movement. The typical position taken is 
that creative abilities are trainable in much the same way as are other 
abilities and that the exercise of these abilities coupled with approp- 
riate rewards will promote development. Some have even proposed that 
individuals can learn specific techniques v/hich will improve their 
creative performance. Such proposals are attractive to the promoters and 
their followers but they lack any kind of theoretical justification. The 
purpose of this paper is to present a theoretical position regarding the 
nature of creative abilities and to test empirically some of the implica- 
tions of this theory for training. 

The study was concerned with determining the tra inability of children 
in the capacity to produce original responses. The general procedure was 
that of measuring the ability to produce original responses before and 
after the training procedure and then comparing the scores on the two 
measures. The assumption underlying the present study is that the 
production of an original response to a task such as the Guilford 
Unus.uaJ Us^s Test depends upon the previous establishment of an associa- 
tion between the element of the task that constitutes the basic stimulus 
and the unusual use that is given as a response. The assumption is that 
the individual does not have the capacity to generate such an association 
in the testing situation itself, but only that he can utilize the 
associations already established. While he may have much greater access 
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to such weak, but previously-established, associations under some condi- 
tions than under others, one would not expect that that training would 
have any effect on the ability to produce original responses in the 
task situation under consideration, for the repertoire of such associa- 
tions is strictly a function of previous training. Furthermore, 
accessabi 1 i ty to such associations would not be expected to be amenable 
to training since accessabi 1 i ty is postulated to be a function of the 
degree to which environmental influences inhibit or facilitate the 
production of such responses. Nevertheless, there is still the possibility 
that any training effects that are found may be primarily motivational, 
that is to say, training may not increase the number of associations 
available and neither may it increase the person's access to these 
associations but it may result in him wanting to produce, (not generate) 
more remote associations (i.e., original responses). In such a case an 
experimenter, who during training becomes a source of motivation for 
producing original responses, may continue to be such a source of 
motivation when he administers the criterion tasks after training. The 
motivation theory appears to be the most plausible one at the present 
time. The studies covered in the review of the literature that follows 
give some support to this position. 

REVIEW OF RELATED RESEARCH 

Studies of Attempts to Train Or i qi na 1 Behavior 
i n Laboratory Si tuat i ons 

Maltzman ( 1958 , et al .) was interested in the conditions that 
facilitate the production of uncommon or original responses. These 
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responses are assumed to be lower in the response hierarchy and to have a 
weaker exci tatory potential than common responses. The procedure employed 
in his first study was an attempt to arrange a situation in which subjects 
would be induced to give responses low in the response probability 
hierarchy. The study further experimented with verbal reinforcement of 
uncommon responses, and instructions to be original during training. 
Subjects (N=120), university psychology students, were all orally presented 
with an initial stimulus word list for training. They were given the usual 
free association instructions to respond only once as quickly as possible 
with the first word that came to mind. They were then divided into three 
groups. The control group was then given a test list. Two experimental 
groups were presented again with the training word list four more times 
and instructed to give a different response each time from the ones given 
previously. At the end of the training period the control groups were 
then given the test list. One of these experimental groups was adminis- 
tered a verbal partial reinforcement schedule of "good*' by the experimenter 
for approximately every fifth uncommon response. The Guilford Unusua 1 
Uses Test was administered to all subjects after the last repetition of 
the training list. Before the admi ni strat ion of the Unusual Uses Test 
half of the subjects in the three groups received special instructions to 
be as original as possible and half were not. Results with the test list 
confirmed the hypothesis that repeated presentations of the initial list 
with instructions to give different responses each time would increase 
the uncommonness of test responses. Almost identical results were obtained 
for the two experimental groups (with and without reinforcement), indicat- 
ing that the verbal reinforcement employed did not influence uncommonness 
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of responses. The reinforced experimental group gave significantly more 

if ' **■ 

original responses than the control group (P— .02 level of conf idence) whi le 
the difference between nonrei nforced experimental and control groups was 
in the same direction but not at an acceptable level of confidence. 
Significant differences were found on the Unusual Uses Test in fa^l/or of 
experimental subjects receiving instructions to be original on the test 
(P=.01 level of confidence). 

The final conclusions drawn from the study indicate that originality 
of word associations can be facilitated by practice in free association 
as well as by instructions. Verbal reinforcement of uncommon associations 
did not produce significant increases in originality. Maltzman stated 
that: "It is possible that a greater number of reinforcements and/or a 

different reinforcement schedule would produce a more pronounced effect 
than the one obta i ned .. .These effects, however, require further confirma- 
tion before they can be interpreted unambiguously." 

An additional problem concerning reinforcement presented itself 
within the context of the foregoing experiment. In this latter study 
an experimental group had been verbally reinforced for unusual responses 
during the training phase, but this procedure had failed to produce an 
effect on the test series. On the other hand, the increase in the 
production of original responses is indicative that some sort of reinforce- 
ment was possibly operating, although not under experimental control. It 
might have been that the verbal reinforcements used simply failed to 
contribute to the reinforcements that were already operating in the 
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Maltzman (I960, ^ aj^.) has reported a series of several additional 
laboratory studies indicating positive results in attempts to increase 
the production of original responses or to increase the originality of 
responses. In the first study of this series, university psychology 
students used as subjects (N=292) were administered a training list of 
twenty-five words taken from the Thorndi ke-Lorge count (19^^) to which 
they were to free associate. They were all given the usual free 
association instructions to respond as quickly as possible with the first 
word that came to mind. Following this procedure, the subjects were 
divided into two control and three experimental groups. One contrdl 
group was given a test list of twenty-five new stimulus words with the 
same instructions as given for the training list. A second control group 
was given five additional presentations of the same training list and 
instructed each time to try and give the same response to a given 
stimulus word each time. One experimental group was given five repeti- 
tions of the new test list and instructed to try and give a different 
response to a given stimulus word each time. Two other experimental 
groups were each given different test lists of 125 different words 
following the initial training list with the same instructions to free 
associate and respond with the first word that came to mind. One group 
received a list of words with a low frequency count, from the Throndi ke- 
Lorge count of words occurring not more than six times per 1,000,000. 

The other received words with a high count; that is, from among the 
500 most frequently occurring words in the count. All subjects were 
given the Guilford Unusual Uses Test immediately after the completion of 
the free association test lists. Results indicated that the experimental 
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group given instructions to respond differently to each repetition of 
each word in the test list was significantly more original than the 
control groups (P=.001 level of significance). A simple analysis of 
variance of the original responses obtained from the Unusual Uses Test 
showed that the experimental groups receiving five repetitions of the 
test list with instructions to give a different response each time gave 
significantly more unique responses than any of the other four groups 
(P=.001 level of significance). None of the groups differed signifi- 
cantly in terms of their fluency of common responses on the Unusual Uses 
Test . A basic assumption in the use of the training procedure employed 
with the one experimental group was that instructions to be different and 
original each time when responding to repeated stimulus words would 
increase the uncommonness of responses. Results showed that this train- 
ing method had the expected effect. 

The second experiment used university psychology students (N=251) 
who were again presented an initial training list and final test list of 
stimulus words with the free association instructions as before. The 
stimulus word list used for training and the subsequent test were the 
same as those of the last experiments; however, the two lists were 
reversed so that what was previously the training list was now the test 
list. Guilford's Unusual Uses Test was administered after the completion 
of the free association test list. The control group received the 
single training list followed by the test list. Four experimental groups 
received different treatments as follows: experimental group I received 
five repetitions of the training list with instructions to give a 
different response for each stimulus word each time; experimental group 
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II was asked to respond differently to five repeated presentations of 
six items from the Unusual Uses Test ; a third experimental group was 
presented with 125 psirs of words and was instructed to underline the 
number of the pair matching more readily the stimulus word presented 
orally from the training list; and a final experimental group received 
the same response pairs as gCoup III but without the stimulus word but 
was instructed to> underline the member of each they thought to be more 
familiar. Results obtained in this experiment confirm the results of 
the first experiment in that experimental group I differed significantly 
from each of the other groups (P=.001 level) on mean originality scores 
on the free association test. The Unusual Uses Test was administered 
to all groups after the completion of the training procedure. Repeated 
evocation of different responses to free association stimulus is followed 
by an increase of originality of responses to new stimulus words and to 
items on the Unusual Uses Test as compared to a group not receiving such 
training. However, training with material similar to that provided by 
the Unusual Uses Test produced no effect . 

A third experiment by Maltzman (I960) was conducted to determine 
the effect of amount of training on a free association word list. 

Amount of training can be varied by either increasing the number of items 
with which to free associate, or increasing the number of repetitions of 
the same items presented. Both variations were employed in this experi- 
ment. Subjects, ( 162 ) • i ntroductory psychology student^ were given the 
same initial training and final test consisting of a free association 
word list as in experiment II. All experimental groups received single 
presentations of the training and test lists. However, prior to 
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receiving the criterion free association test list four groups were 
exposed to different experimental treatments. One group received ten 
repetitions of six items from the Unusual Uses Test while a second group 
received twenty repetitions of the same items. Two more groups received 
six additional items from the Unusual Uses Test repeated either five or 
ten times. Thus, treatments consisted of either ten or twenty repetitions 
of six Unusual Uses Test items or five or ten repetitions of twelve 
similar items. No evidence of a facilitating effect of training by 
repeated evocation of different responses to the same stimuli was obtained. 
It was likewise fourid that no significant differences occurred with 
varying the number of unusual uses items on the training list prior to 
the free association test for originality. There was no significant 
change in uncommonness of responses on the free association test. 

Maltzman ^ a^. concluded that repeated presentations of unusual uses 
items were ineffective because they did not produce large enough increase 
in uncommonness of responses evoked during training. Therefore, a fourth 
experiment was conducted using the free association situation for train” 
ing since reliable effects upon originality had previously been obtained 
under these conditions. 

Experiment IV of the Maltzman (I960) series again utilized (N*=201) 
introductory psychology students. Two new stimulus word lists used for 
initial training and free association testing were drawn from Woodworth 
and Wells (1911) on the basis of their tendency to evoke opposttes. 

There were twenty-four words in the training list and ten words in the 
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The Guilford Unusual Uses Test was administered after the test list. 
The control group received the test list of 10 new stimulus words follow- 
ing completion of the initial twenty-four word training list. Three 
different experimental groups received one, five, and ten repetitions of 
the training list prior to the presentation of the test list. Results 
indicated no significant differences between experimental groups. Result 
from the free association test of originality indicated that each of the 
three experimental groups gave more original responses than the control 
group (P-.OOl). The results of tests indicated that the three experi- 
mental groups did not differ significantly from each other on the Unusual 
Uses Test . Each of the three experimental groups gave significantly more 
unique response on the Unusual Uses Test than the control group (P=.05 
for the five repetitions group versus control, and P=.001 for the other 
two experimental groups versus control group). A fifth experiment was 
conducted by Maltzman (I 96 O) in which he hypothesized that if originality 
can be learned, according to the principles of instrumental conditioning, 
then it should show some degree of persistence. Subjects (N=77) 
university psychology students, were administered stimulus word lists 
from the Thorndi ke-Lorge count as in studies I, II and III previously 
reviewed. In addition to the initial free association training list, a 
test list and the Guilford Unusual Uses Test were administered. All 
subjects received the training and test lists. Two control groups 
received the free association test and Unusual Uses Test I 48 hours 
following training. Two experimental groups experienced comparable 
delays but, as in the previous experiments received training by receiving 
five repetitions of the training list with instructions each time to 
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respond differently to each stimulus word and to be original. Significant 
differences were obtained for the originality training effects of the 
experimental groups as compared with the control groups (P=.001 level) on 
the free association test following training. Significant training effects 
after delays from one hour to two days on the free association test of 
originality were obtained (P=.05 level of significance). There was no 
significant decrease in the number of unique responses when the delay 
was increased from one hour to two days. 

As a final conclusion to the series of five experiments reviewed 
herein, Maltzman has stated: "The results of the experiments lend some 

support to the hypothesis that originality is learned behavior and varies 
as a function of the same antecedent conditions as other forms of operant 
behavior. Effects of originality training may persist for as long as two 
days . . .ori gi nal i ty , at least on the free association test, also varies as 
a function of the number of repetitions of the training list." 

A few comments must be made on certain striking features of the 
Maltzman series of experiments. First, there is a certain oddness about 
the finding that free association activities; result in an increment in 
score on the Unusual Uses Test , but that training on unusual uses items 
does not. Certainly one would not expect a greater amount of transfer to 
occur from a task very different from the criterion task than from a 
closely similar task. Such a finding would be unique in the entire 
literature of l^.arning research. While Maltzman has an explanation to 
offer for this paradox, an alternative and much simpler explanation is 
that he was not dealing with a learning phenomenon at all, but rather his 
results should be interpreted as representing a facilitation phenomenon 
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similar to that manifested by a pitcher when he warms up in the bull pen, 

A second important point to note is that, throughout the Maltzman 
series of experiments, the experimenter was both the administrator of the 
training task and the administrator of the pre-test and training task. 
Under such ''onditions, the training tasks may help to define what the 
experimenter is requiring of the subject and, having learned what is 
required of him, the subject may show an improved performance on the 
criterion task. Under such conditions, the presence of the experimenter 
cues the performance of the subject on the criterion task. One suspects 
that if the criterion task were given by a different experimenter and 
if the criterion task were not connected in any way with the training task 
that the so-called training effect reported might well vanish. 

The puzzling nature of the Maltzman findings are rendered even more 
puzzling by the fact that Anderson and Anderson (I963) trained subjects 
in a situation requiring them to name as many uses as they could of 
objects and found that the training transferred to a Novel Uses Test . The 
fact that Maltzman found no effect of such training, while Anderson and 
Anderson (I963) did, can perhaps be attributed to the extended nature of 
the training in the latter study and the very limited training given in 
the former. It should be noted that Anderson and Anderson, like Maltzman, 
also used the same experimenter to run the training sessions and to 
administer the criterion tests, so the effect reported may well not be a 
genuine learning phenomenon. 
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Stud i es of Attempts to Train Original 
Behavior i n Academi c Si tuat ions 

Fames and Meadow (1959) investigated training effects of a 30-hour 
course in creative thinking using the so-called “brainstorming'* technique 
outlined by Osborn (1957)* The usual description of the technique is that 
of withholding or deferring judgment while producing ideas. Fames writes 
that: “The basic thesis of this technique (brainstorming) is that 

creativity is encouraged by the temporal segregation of hypothesis forma- 
tion and the judicial evaluation of the adequacy of hypotheses." The 
training course emphasized the brainstorming technique throughout with 
p re-* and post*trai ni ng devices measuring the variables of quantity and 
quality of ideas and three personality variables of need achievement, 
dominance, and self-control. Subjects (N-162) , university school of 
business day and night students, were equally divided in an experimental 
group and two control groups. Eleven measures comprised the pre and post 
assessment devices which were divided between the two control groups. 

One control group was administered six creative ability measures 
Including the Guilford battery and the AC Test of Creativity while the 
other was administered five personality measures including the Thematic 
Appercept i on Test and the Ca 1 i forni a Fsychology I nventory . Control 
subjects were matched with experimental subjects on age, sex, and the 
Wechsler Vocabulary Score. The experimentally trained group was 
administered all eleven measures by their course instructor at the begin- 
ning and end of the semester- s course . Results comparing the experimental 
and control groups indicated significant differences on both quantitative 
and qualtative measures of five creative ability tests in favor of the 
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experimental subjects (P=.01 level of significance). Fames and Meadow 
stated that: "Since the instructor carefully avoided practice on any 
objects even remotely similar to the type of objects which appeared on 
the tests, the results do indicate generalization of this training." 
However, the trained student did receive instruction and practice on types 
of problems similar to those included in the tests. Of the three person- 
ality measures used, the CPI Dominance sca.le indicated a significant 
difference, or an increase in Domi nance, of I the expet inienta 1 groups as 
compared to the control group (P=.05 level). The other two personality 
variables measured did not yield significant differences. The final 
conclusion of the study was that training in the technique of brain- 
storming produced a significant increment on measures of quantity and 
quality of ideas and on the personality variables of dominance. It is 
interesting to note that the same instructor initiated all tests as well 
as conducted the training program for the experimental group. An interest- 
ing question arises whether the difference in creative performance on the 
tests wtas attributable to practice or to an artifact in which subjects 
were responding to the tests the way they thought the instructor wanted 
them .to . 

An experiment designed by Fames and Meadow (I960) to evaluate the 
persistence of the effects of a course in creative problem solving 
emphasized the brainstorming technique. An experimental group that had 
taken a creative problem solving course eight months previously was compared 
with a control group of students who had Just enrolled in a similar course. 
Experimental and control groups (N=48) were matched for vocabulary ability 
on the basi s of the Wechsler Adult I ntel 1 i qence Vocabulary Test . The 
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experimental subjects had previously taken the course in creative problem 
solving at least eight months prior to the study and the control subjects 
were registered for the course but had received no training. Both groups 
were compared on six measures of creative ability which included the Guilford 
Apparatus Test . Unusual Uses Test . Plot Titles Test . and Possibi 1 it ies 
Tests and quantity and quality measures on the Uses Test from the AC 
Test of Creative Ability . No subject in either group had ever before 
taken these creative thinking tests. Results indicated that the experi- 
mental subjects trained eight months before outperformed the control 
group. Four of the six differences between the groups were statistically 
significant (Beyond the P^.05 level of significance). Differences between 
the experimental and control subjects for the Apparatus and Unusual Uses 
measures did not meet the .05 level of significance. The authors stated 
that: ’’Results indicate that increased productivity in creative thinking 

produced by the creative problem solving course persists for a period of 
eight months or more after completion of such training." 

These experimental studies have attempted to control and manipulate 
variables in a deliberate development of fluent and original thinking as 
factors of creativity. Only a few studies have been found that were 
designed to develop creative thinking in school classroom situations. Two 
such studies will be discussed because of their relevance to the present 
investigation. Hutchinson (1963) conducted four fifty-minute in-service 
teacher training seminars in which he instructed classroom teachers rn 
processes of divergent thinking. Likewise, students of these teachers 
were instructed in the group methods of brainstorming. Subjects, (N=256) 
seventh grade students, were divided into two groups, four experimental 
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and four control classes matched by mental age and sex. The four control 
classes were taught by teachers who did not participate in the training 
seminars and instructed their classrooms in the usual manner,. The four 
experimental classes were taught by in-service trained teachers who were 
shown how to modify their teaching methods to include classroom work 
involving divergent and evaluativ^e thinking. Ten measures of 
creative thinking and also subject-matter tests were administered towards 
the beginning and towards the end of the experiment to both groups. Both 
groups of classes were taught a two-week social studies unit on transpor- 
tation and communication. Tape recordings were made of all class sessions 
of both groups and classified according to the Aschner-Gal 1 agher Classifi- 
cation System, There was no significant difference between the two 
groups in the amount of gain shown on subject matter tests, but the 
experimental group had significantly greater gains in four of the ten 
measures of creative thinking (three at the P=.05 level of significance, 
one at the P~.01 level). These might be attributed to training though 
other explanations are possible. The author stated that by teacher 
training and modification of instruction methods for experimental classes, 
there was a distinct change in verbal response categories during classroom 
instruction. The experimental group decreased 30 percent in the routine 
response category, increased 20 percent in the cognitive-memory category, 
and increased 23 percent in the divergent-thinking and evaluation category. 

Knight (1963) also designed a study to measure the training effects 
on fluency, flexibility, and originality on a mixed grade of fifth and 
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sixth grade school children (N-28) . A control group in another school 
was matched with the experimentally trained group on age, intelligence, 
sex, general scholastic ability, and occupational level of the family 
wage earner. 

Seven tasks were administered to both the experimental and the con- 
trol groups both before and after the training program and from these 
tests fifteen scores were derived which could be combined into measures 
of fluency, flexibility and originality. Training consisted of exercises 
involving psychological operations closely similar to those involved in 
the evaluation devices. Some of these training tasks were performed 
by pupils working alone but some required group activity. The training 
program extended over the school year. Knight was the teacher of the 
group exposed to the special training and was also the person who admin- 
istered the tests at the beginning and end of the school year. Signi- 
ficantly greater gains were found for the experimental group over the 
control group on 14 of the 15 measures of creativity at levels of signi- 
ficance which ranged from .05 to .001. Achievement tests also given at 
the start and finish of the school year indicated that any gains made 
on the creativity tests were not made at the expense of the commoner 
forms of the school achievement. 

The studies of attempts to train original responses leave the 
reviewer with considerable discomfort concerning the interpretation to 
be given to them. First, there is the disconcerting fact that, without 
exception, the same experimenter served as trainer and as test adminis- 
trator. The possibility remains that training may have taught the subjects 
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to produce a certain kind of response for the experimenter. The subjects 
have learned not so much to be creative but to do what the trainer expected 
of them. The results of this might be that when they came to the final 
criterion test they knew better what was expected of them than they had 
when they took the initial test. In a sense, the effect of training 
may have been that of giving subjects a better understanding of what was 
expected of them when faced with a test of creative ability. Much the 
same effect as that produced by training should be produced by using 
suitable directions on the test. This hypothesis can be examined since 
a number of studies have now been conducted which investigate the effect 
of administrative directions on performance on such tests. These studies 
must now be examined. 

Studies of the Effect of Administrative Pi rections on 
Performance of Tasks Requi ring Creative Behavior 

Chr i stensen ^ aj_. , (1957) experimented with fluency and originality 
on tasks emphasizing inventiveness. It was hypothesized in this study 
that production of original responses on an open-end test calling for 
creative thinking would be relatively constant across time. It was 
further hypothesized that more original responses would come later in 
the production period, and that with instructions to be clever the total 
quality of responses would be reduced while more clever responses would 
appear. Each of these hypothesis were treated in separate parts of the 
study. Criterion performance measures consisted of the Plot Titles Test , 
an impossibilities test, and the Guilford Brick Uses Test . Subjects 



(N=400) were Air Force cadets and college students. The relation of 
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number of responses to time for the written creative thinking tests used 
showed less decrement with time as compared to verbatim-recall tasks. 
Uncommon responses increased in frequency with time. The relation of 
remoteness of responses to temporal order of occurrence showed signifi- 
cantly greater numbers in the second-half over the first-half of the 
test performance (p=.01 level of confidence). The increase of cleverness 
of responses with time did not occur as predicted. When two plot titles 
tasks were administered to two groups with instructions to one group 
to give only appropriate titles, and to the other to give appropriate, 
clever and novel titles the following results were obtained: A signifi- 

cantly greater number of clever responses were produced by instructions 
to give clever and novel titles (p=.01 level) even though there was a 
smaller average total number of responses produced under this condition. 
As expected, fewer total responses were produced when subjects were 
instructed to be clever. When these were compared to a group of subjects 
receiving no special instruction to be clever, the correlation between 
number of clever responses under instructions to be clever and no instruc 
tions was .62. Christensen stated: “It appears that individuals are 

more or less disposed toward making clever responses whether or not they 
are explicitly instructed to do so.“ Other studies have shown a decrease 
in quantity of total output under instructions to be original or clever. 
This effect is probably due to self-imposed censorship of the subject 
which facilitates rigid limits of rejection for so”called non*^clever 
responses . 

Meadow, Fames and Reese (1959) designed another study to measure 
subjects' ability to solve a problem requiring creative ability either 
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under instructions to express all the solutions they could without 
evaluation (brainstorming) or with instructions to express only solutions 
of good quality and which involved a penalty for solutions of bad quality 
The penalty was the subtraction of ideas rated ‘'bad*' from the total, and 
this information was given to the subjects. The quality of the solutions 
was later judged by a trained rater. Subjects were (N=32) college 
students who had been enrolled in a course in creative problem solving 
which stressed the brainstorming procedure for problem solutions. Sub- 
jects were randomly divided into four experimental groups. The problems 
were taken from the ^ Test of Great i ve Ability . One fourth of the sub- 
jects was given one of the two problems either under brainstorming or 
nonbrainstorming instructions. Each response was rated for uniqueness 
and for value on a 3“Point scale. A response was designated as "good*’ if 
the combined uniqueness and value score was above five. The number of 
good responses was then expressed as a percentage of the total number of 
responses which constituted each subject ‘s creativity score. Results 
showed that significantly more good solutions were produced under the 
brainstorming instructions than under nonbrainstorming instructions (P= 
.01 level of significance). Significantly more good solutions were 
produced when brainstorming instructions were given in the first test 
period than when given in the second test period following nonbrain- 
storming instructions (.05\P^ .02). One main point should be emphasized 
with respect to interpretation of the study; all subjects were initially 
trained in the brainstorming technique. Because of this factor another 
study was devised using nontrained subjects. 
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Parnes and Meadow (1959) designed this second study to determi ne the 
effectiveness of brainstorming instructions with untrained subjects and to 
compare the data thus derived with the data from the subjects previously 
trained in brainstorming. University undergraduate students (N=52) were 
used. Grade poi nt average was used for matching formerly trained subjects 
to untrained subjects. Trained subjects differed from untrained subjects 
in havi ng been exposed to a one"semester course in creative problem 
solving utilizing the brainstorming principle. The same two problems from 
the AC Test of Creative Abi 1 i ty were used to provide the criterion scores 
as in the previous study. The untrained subjects were given one problem 
with the second following immediately thereafter; each problem was 
administered either under brainstorming instructions or nonbrainstorming 
instructions. The same procedure was followed for trained subjects. 

The data showed that more good ideas were produced under the brainstorming 
instructions than under the nonbrainstorming instructions, and this effect 
was greater in the second test period than in the first. Subjects 
formerly trained in a course emphasizing the brainstorming technique 
produced a significantly greater number of good quality ideas when using 
the technique than did untrained subjects (P=.01 level of significance). 

An additional result reported a positive correlation between quantity 
and quality of ideas either under brainstorming or nonbrainstorming 
instructions (rs*63 to .81). The point to be made here is that with 
eithferrspectf Tc instructions to be fluent or original under brainstorming 
or nonbrainstorming instructions some subjects were able to produce 
original ideas. 
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Another study by Fames (I 98 I) involved two experiments designed to 
test the hypothesis that extended effort in idea. production during a single 
session will lead to an increasing proportion of good ideas. Both experi- 
ments used university undergraduate students divided into two groups. 

Group. one (N=l46) comprised untrained students who were presented the 
Hanger problem from the AC Test o£ Creative AbilLty . Instructions were 
given these subjects to think of all possible uses of an ordinary coat 
hanger in the time limit of 5 minutes. Scoring yielded a quality score 
defined as uniqueness and unconventionality; and unduplicated responses 
were counted for a quantity score. Interrater reliability for ratings of 
fifty subjects selected at random was .74. Each subject's data wais 
divided into halves on the basis of total number of ideas and an analysis 
of variance was used to determine differences in quality of ideas between 
the first half of each subject's ideas and the second half. Results 
indicated highly significant differences (p=.001) between halves in favor 
of the second half. A second group (N=42) of students that were enrolled 
in a creative problem solving course which consisted of training and 
practice in brainstorming was tested. Near the completion of the course 
they were presented the Hanger problem as in experiment 1 above with treat- 
ment time extended to 15 minutes, and it was again scored for quantity 
and uniqueness of responses. The mean for the last third was signifi- 
cantly greater than the means for both the first third (P=.01) and middle 
third (P=.05) of the list. The means for the first and middle thirds did 
not differ significantly. 

The data that has been presented up to this point i ndi cates that the 
directions given in administering a test of creativity, and the 
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interpretations given to them by the subjects are important factors in 
determining the general level of the scores. This effect is so substantial 
the possibility exists that the "training effects" reported by Parnes and 
others may well be a result of a changed orientation towards the criterion 
test produced by training rather than a genuine learning effect. Sn this 
connection a study by Gerlach, Schutz, Baker, and Mazer (1964) throws some 
light on the matter. in this study, two tests of creativity were adminis- 
tered with differing administrative directions. Altogether six different 
sets of directions were used which included the standard directions, brain- 
storming directions and what were referred to as "criteria cued" directions. 
The latter directions spelled out in considerable detail the performance 
required on the test. This set of directions yielded higher scores on the 
test than a trained group previously reported by Parnes (1961). The 
students in the Parnes experiment who had been trained in brainstorming 
and who were given the test with brainstorming directions had a slightly 
lower mean than the untrained group in the Gerlach, Schutz, Baker, and 
Mazer study. The data suggests that the orientation of the subjects 
towards the criterion task may be the crucial factor operating in most of 
the experimental studies of training in creativity that have been reviewed 
in this chapter including those of Maltzman, Another very important 
finding to note is that, when very complete directions were given, the 
subjects showed no tendency to produce the more original responses in the 
later part of the testing time. This finding negates the notion that 
practice during a single session produces some mysterious learning effect. 

It appears to be the task orientation which determines, to an important 
degree, the order in which responses will be evoked. 
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An Overview of Studies on the Training of Creative Talent 

The review of the literature which has been undertaken reveals an 
odd inconsistency between the state of knowledge in the field and current 
educational practice. A brief inspection of the course offerings of 
institutions concerned with the training of teachers reveals that courses 
on the development of creative talent in children are widely offered as 
though there were an extensive and wel 1-devel oped body of knowledge to trans- 
mit. Our review of the literature shows that the area is one which has 
stimulated experimentation and exploration, but that the outcome of most 
research is a set of controversial conclusions. Those who have engaged in 
research in this area cannot be blamed for the lack of definitive findings. 
Attacks on new areas at the frontiers of knowledge typically produce what 
appear to be inconsistent findings until experimental techniques are evolved 
capable of producing stable results. One is reminded of the fact that 
research on learning has had to have taken place over almost an entire 
century before laboratory techniques emerged which yielded highly consistent 
results. Only in the last fifteen years has knowledge of learning phenomena 
developed to the point where laboratory demonstrations can be givin in front 
of entire classes of students and with assurance that learning will occur 
exactly as predicted. Such control over behavioral phenomena is evidence 
of a mature science. The area of research on creativity is at the beginning 
of its history and the knowledge which such research has produced must be 
considered to be highly uncertain. One should, perhaps, regard it as know- 
ledge suitable for stimulating the future work of scientists rather than 
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research which has direct applications to problems of education. These 
introductory comments serve to suggest that the study which follows is 
not intended to provide any immediate solution to an important educational 
problem, but serves the purpose of helping to determine the nature of the 
task of developing creative talent. 

THE PROBLEM 



The purpose of the present study is to provide relatively prolonged 
training in the making of original responses and then to evaluate the extent 
to which the training is effective. A major difference between the present 
study and previous studies is that care has been taken in the present study 
to eliminate from the evaluation procedures certain conditions which have, 
in previous studies, appeared to produce spurious effects. Specifically, 
the use of the same experimenter both for the presentation of the training 
materials and for the administration of the evaluation tasks has been replaced 
by a procedure in which the experimenter administered the training tasks, but 
the teacher administered the evaluation instruments as a part of the regular 
classroom routine. Thus any spurious effect which the experimenter could 
have on the procedure is eliminated. In addition, fairly prolonged training 
Is given involving in the one case the practice of original responses and 
in the other practice together with reinforcement. The expectation was 
that the training effects which have appeared in previous studies would 
vanish under the better controlled conditions of this study. 
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METHODS AND PROCEDURES 

This chapter presents the experimental procedures of the study In 
terms of: a) subjects; b) training tasks — their characteristics, 

administration and scoring procedures; c) criterion tasks — their 
characteristics, administration and scoring; d) training conditions of 
reinforcement versus nonreinforcement; and 3) the design. 

Subjects 

Those who participated as subjects in the experiment were 209 
elementary school students (110 girls and 99 boys). All Ss were sixth 
grade students from three elementary schools. Two of the schools, are 
located in Bountiful, Utah, a suburban residential area north of Salt 
Lake City. Three sixth grade classes in one of these schools were 
trained for fourteen days by an experimenter on a task involving creative 
abilities and were reinforced by a partial reinforcement schedule each 
day of training. These Ss are referred to as the reinforced experimental 
trained group and are designated by their respective school room numbers 
of 10, 11 and 12. Three more sixth grade classes at the other school 
were likewise trained for the same fourteen days but were not reinforced 
during training. These Ss are referred to as the nonrei nforced experimen- 
tal trained group designated by their respective school room numbers of 
22, 23, and 24. Criterion tasks were administered during, after and 
two weeks following training by the regular classroom teacher in all six 
classrooms of both experimental trained groups. A third school, located 
in an old residential section of downtown Salt Lake City, was used as the 
control group for certain purposes. Three sixth grade classes at the 
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latter school were administered only the criterion tasks by an experimenter 
over the same time interval as the experimental trained groups. This 
group of Ss (N=4y) served as a control group not receiving training and 
is designated as control group classes 1, 2 and 3. 

Neither of the two experimental schools grouped children in a 
grade with respect to intelligence or achievement. The range of intell- 
igence measures for this sample of elementary school students as subjects 
was from 8l to 146. The three classes of the control group were grouped 
by intelligence scores into two classes: a gifted class (room 3) made up 

of students with intelligence test scores over 110, and the other two 
rooms (1 and 2) made up of the remaining students. Table 8.01 indicates 
the mean and standard deviations of the intelligence test scores by class 
measured by the California Mental Maturity Test which had been administered 
to all Ss in the fifth grade. The table reports data for only those pupils 
used in the analysis of the data. The basis for this selection will be 
described in later sections. 

At the completion of the study, test data on all Ss who had been 
absent on any one of the counterbalanced training days or on any one of 
the criterion testing days was eliminated. The remaining Ss were matched 
as nearly as possible across reinforced and non- reinforced groups by 
intelligence test scores. Complete test data on a total of S6 Ss who 
were matched by intelligence scores and had been present throughout the 
complete five week training and testing program then remained as the 
experimental sample for analysis. 
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TABLE a. 01 

INTELLIGENCE TEST SCORE MEANS AND STANDARD 
DEVIATIONS FOR EXPERIMENTAL AND CONTROL GROUP CLASSES 
FOR CASES USED IN THE STUDY 





N 


Mean 


Standard Deviation 


Reinforced Experimental 
Trained Group* 


Rooms 10 


16 


114.25 


15.04 


11 


17 


121 .11 


11 .44 


12 


15 


113.87 


12.38 


Nonrei nforced Experimental 
Trained Group* 


Rooms 22 


17 


120.70 


12.18 


23 


13 


113.15 


14.36 


24 


18 


114.94 


12.96 


Control Group 


Rooms 1 1 


15 


91 .90 


10.3 


2 


18 


90.53 


15.1 


3 


14 


118.64 


9.9 



^California Mental Maturity Test 
**Pinter Verbal Intelligence Test 



Train! ng Tasks 



Task Character! sties 

The training task had a form similar to the Guilford Uses Test (1962). 
It consisted of a list of ten names of common things (objects) presented 
in print form on paper. This task is referred to as the training word 
lists. Fourteen training word lists, one for each day of training, were 
prepared on multi lith forms for the study. The forms were prepared in a 
manner so as to differentiate them from the “ditto'^ form of the teacher 
controlled criteria tasks. These training word lists presented by the 
experimenter were purposely made to appear distinct from a classroom 
exercise. Even though fourteen different word lists were used, three of 
these were counterbalanced across the three classes of both the reinforced 
and nonrei nforced experimentally trained groups on the 7th, 11th, and 
l4th days of training. Printed instructions on each word list directed 
the Ss to find a different use for each thing, to be original and 
creative, and to use their imagination. 



Admi ni strati ve Procedures 

One list of ten names of common things which comprised the training 
word list was presented each day for the fourteen training days by an 
experimenter following a short orientation procedure. Subjects were 
instructed to think of one unusual use for each object. The following 
training instructions were given each day by the experimenter before 
administering the task: 

This is an exercise to see how well you can use your 
imagination for thinking up unusual uses for common things 
which are all well known to you. Try to think up a different 
use for each thing than the one you think your neighbor would 
put down. Be original, do not suggest a use for the thing 
which most people would think of. You are to think up one 
unusual use for each thing and write it down as soon as it 
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comes to your mind. Work rapidly, but try to think up 
a use for the thing in a way that it was not originally 
intended to be used. If you cannot think of a different 
use for the thing after a short time, skip it and go on 
to the next thing. Be brief, do not worry about your 
spelling or complete sentences. Just write the first 
unusual use for the thing that comes to your mind. 

The orientation procedure preceded the administration of each train- 
ing word list and was standardized across all six experimental classes at 
the two schools. The experimenter read the training instructions aloud 
each day and then picked some item at random in the room; such as a 
blackboard eraser, or a pencil, etc., to use as an example. He would then 
ask the class for their ideas on how they might use the object differently 
while pointing out its inherent attributes or qualities, and informing 
Ss to think of using its different parts or the whole object in some new 
or unusual way. An example of the typical training orientation used by 
the experimenter is as follows: 

Try to think of a different use for this thing than 
the one you think your neighbor would put down. Be original , 
do not suggest a use for the thing which most people would 
think of. Some things have their common use within the name 
of the thing such as tooth brush. The common use would be 
to brush your teeth, but try to think of a new use for a 
tooth brush such as to apply paint in cracks. You may use 
all or any part of the thing, or change its form. For the 
tooth brush you may think of removing the bristles and using 
its plastic handle as a tool for working with clay. Other 
things do not have their use implied within their name, 
such as a brick. Here you can be as original as you want. 

How would you use it differently? Here are some examples 

Number 1. brick powder it and make red 

paint 

Number 2. newspaper to make a kidnap note 

Number 3 • paper clip as a fish hook 

You are to think up only one unusua 1 use f or each thing 
and write it down as soon as it comes to your mind. Work 
rapidly but try to think of a use for the thing in a way that 
it was not originally intended to be used. If you cannot 
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think of a different use for the thing, skip it and go on 
to the next one. Be brief, do not worry about your spelling, 
just write the first unusual use for the thing that comes to 
your mind. 

Are there any questions? 

Each class was allowed to run-off ten to twelve ideas orally before 
proceeding on to the written training task for the day. The training 
procedure and time for handing out papers took about 8 minutes for each 
class. A strict time limit of 5 minutes was imposed for each training 
word list. At the end of this 5 minute period, time was called, all the 
papers were collected, and the experimenter moved on into the next class- 
room . 

Rei nfor cement 

A verbal reinforcement schedule was used for the three classes that 
constituted the reinforced experimental trained group. At the first of 
every training period, following the first day, the training word list 
papers of the preceding day were returned to each S. These had been 
scored and some of the responses were reinforced with "excellent," "very 
good," or "good." These words were stamped in red alongside those ideas 
judged for their originality. Since no scoring guide was available for 
the training task, the basis for choosing responses to be reinforced was 
the infrequency of the responses among the population sampled. One of 
the members of the Bureau of Educational Research staff daily reinforced 
ten percent of those responses determined to be original in terms of 
infrequency. The most infrequent, relevant responses in the population 
sample were reinforced with an "excellent." The next most infrequent 
relevant responses were reinforced with a "very good," and the third most 
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infrequent responses with “good.** 

In this manner, each S of the three reinforced classes was given 
daily knowledge of results on how original his or her ideas were the day 
before. The pupils responded enthusiastically to the daily return of their 
papers. These papers were first distributed and then collected before the 
training word list for the day was handed out. The initial training 
procedure was then carried on by the experimenter. The Ss of the non- 
reinforced classes were given no knowledge of results on their daily 
training performance since their papers were not returned each day. All 
teachers remained in each classroom during every training period either 
observing the procedure in the back of the room or working at their desks. 

Scor i nq 

The responses to the training word lists for the experimental 
subjects were scored for originality. The forms containing the answers 
were coded by number and all names and any reinforcing statements 
stamped on them were removed. This was done to eliminate class or 
treatment identity to prevent contamination of the results. 

Training task word lists for the first day of training were 
distributed evenly among eight members of the Bureau of Educational 
Research staff for practice scoring along with a set of scoring 
instructions. The scoring procedure consisted of three steps. First, 
scorers were to eliminate all irrelevant responses, or those that were 
irrelevant. Of the remaining responses, scorers were asked to choose 
ten percent of the most infrequent and unusual uses for the items 
listed. From the ten percent chosen as original responses, scorers were 
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then asked to select one third of these as the most unusual or creative 
responses. The procedure for scoring a response as original was adapted 
from Torrance (1962) and was based upon the infrequency of the response 
in the sample. 

After the papers from the first day of training had been scored once, 
they were collected and again redistributed for a second scoring among the 
same eight scorers. In this manner all Ss training papers for the first 
day of training were scored twice for originality. Interscorer contin- 
gency coefficients between scorers for individual items ranged from .11 
to .36 which were all significant at the .001 level of confidfence except 
one at the P=.05 level. The over-all contingency coefficient for all data 
combined was 0.22. 

After this practice scoring procedure the eight scorers set about 
the task of scoring the counterbalanced training word lists administered 
on the 7th, 11th, and l4th training days. The procedure was adopted of 
coding the papers, shuffling, and then distributing them among the eight 
scorers. After these had been scored once for originality, scoring sheets 
were removed and replaced with new ones after the papers had been reshuffled. 
They were again distributed for a second scoring. 

Cr i terion Tasks 



Task Characteri sties 

Teacher administered criterion tasks were used for evaluating 
transfer from the experimenter training exercise to the classroom teacher 
situation. Two types of criterion tasks were utilized, l) The first was 
a written task called the Ask-and-Guess Test (Torrance, 1958) - a paper- 
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and-pencil test containing three parts, each requiring the student to 
respond differently while viewing picture prints from Mother Goose stories. 
Three 35mm colored slides were projected on a screen, one for each test 
period, for group, classroom testing. The first part of the test requires 
the student to ask questions about the picture which cannot be answered 
directly from the picture. In the second part the student is asked to 
make guesses or formulate hypotheses about the possible causes of what is 
shown in the picture. The third part asks for consequences, both immediate 
and remote, of what is happening in the picture. The three Mother Goose 
picture prints used for this Task are identified as P| , ?£, and P^. 

2) The second set of teacher-controlled criterion tasks consisted 
of three written exercises designed for the study. These are referred to 
as the A, B, C, criterion tasks: The first. Task A- Lost . asks the student 
to list unusual ways he would use the things mentioned in the problem in 
order to feed and protect himself in a situation involving being lost in 
wild country. He is asked to think of ingenious uses for five things 
listed in the problem; such as, safety pin, ball of string, leather belt, 
etc. This task is similar to the unusual uses training task but is 
stated as an open-ended problem situation. The second. Task B- Rope 
Problem , is a modification of the Maier pendulum problem. The student is 
asked to think of different uses for a given number of things in order to 
solve the problem of tieing two ropes together that are not long enough 
for both to be reached. The third Task C- Rocket Ship Situation , asks the 
student to list consequences of things that would happen as a result of a 
given situation. 

All criterion tasks were reproduced in the same way as other school 
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mater ials-by ditto machine on a cheap white paper. The purpose of this 
was to lead the pupils to believe that they were undertaking a school 
assignment. They were administered by the regular classroom teacher 
(except in the case of the control group) and the teachers were asked to 
make no reference to the experiment. 



Admi ni strat i ve Procedures 

Both types of criterion tasks were administered before and after 
training and two weeks following training by each classroom teacher of the 
six experimental classes involved in the study. These tasks were adminis- 
tered by the experimenter to the three control group classes over the same 
time interval. The following complete instructions were used by each 
classroom teacher and the experimenter for administering the three parts 
of the Ask and Guess Test. 

General Directions (Flash picture on screen) 

There are many ways of finding out the things you want 
to know and need to know — the things you need to know in 
order to evaluate a situation and plan your strategies for 
coping with the situation. Our favorite ways of finding out 
things is by asking questibns, observing, and making guesses. 

To the extent that we can, we check these guesses against 
other things that we can observe or know from other sources. 

The three tasks in this booklet are based on a picture which 
you will be shown shortly. You will be able to look at the 
picture throughout the three tasks. 

1 . Ask and Gues s - 5 mi nutes 

This task will give you a chance to show how good you are 
at asking questions. In the spaces on this page, write down 
all of the questions you can think of about the things you 
see in the picture. Ask the questions you would need to know 
to understand what is happening. Do not ask questions which 
can be answered just by looking at the picture. Ask as many 
questions as you can think of. If you need more space, use 
the other side of this sheet. You will have five minutes to 
work. 
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2. Guess Causes - 5 minutes 

This task will give you a chance to demonstrate your 
ability to guess causes or formulate hypotheses concerning 
causations. In the spaces on this page, list as many pos- 
sible things as you can which might have caused the action 
shown in the picture. You may use things which happened 
immediately before the action in the picture, or something 
that happened a long time ago. Make as many guesses as 
you can. Do not be afraid to guess as no one can really 
know from the information which you have. If you need more 
space, use the back of this sheet. 

After five minutes, call time and go on to the third task. 

Guess Consequences. 

3. Guess Consequences - 5 minutes 

This task will give you a chance to show your skill in 
guessing the consequences of this action. In the spaces on 
this page, list as many possibilities as you can of what 
might happen as a result of what is happening in the picture. 

You may use things that might happen right afterwards or 
things that might happen as a result long afterwards in the 
future. Make as many guesses as you can. Do not hesitate 
to guess. No one can know for sure with the information which 
you have been given. 

After five minutes call time and collect booklets. 

The Ask-and-Guess Test was specifically timed for a total of 15 
minutes, allowing 5 minutes for each part. Instructions for administering 
and the time limits imposed for each part of the Test were derived from 
Yamamoto (1962). Test-retest reliability correlations for these picture 
tests reported by Yamamoto (1962, p. 70) range from .56 to .83. Torrance 
(1963, p. 5) reports test-retest reliabilities from .79 to .89 with a 
total of .85 for all three parts of the test. Wodtke (1963, p. 56) 
reports test-retest reliabilities for the Ask- and-Guess Test ranging from 
.49 to .74 with an over-all of .66 on subjects at the fifth grade level. 

All of the Aj B_, £ tasks were timed with an imposed limit of 5 minutes 
for each. The instructions each teacher used in administering these three 
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were as follows: 

These are exercises to see how well you can use your 
imaq i nation . Think of as many different ways as you can 
to complete the exercises in the tsme you have. 

Lost - Exercise A - mi nutes 

You are to list what you would do or how you would 
use the things mentioned in the situation in order to 
feed and protect yourself. 

Rope Problem - Exercise ^ mi nutes 

Think of as many different ways in which you could 
use the given things in order to tie the ropes together. 

Rocket Ship S i tuat i on - Exercise ^ - 5. minutes 

List as many interesting things as you can think of 
that would happen (consequences) if this rocked ship was 
used by many people. 

Samples of each of these three tasks are included in the Appendix A. 
Scoring 

All six criterion tasks for the experimental sample of S6 Ss were 
scored in the same procedure as the counterbalanced training word lists. 
Directions for scoring were prepared, and all Ss papers were coded, 
shuffled, and distributed among the same eight scorers used for the 
training tasks. The Ask- and- G uess Tests were scored for both originality 
and fluency. The same scoring criteria of selecting ten percent of the 
responses as original according to their infrequency within the sample 
population were adopted. The fluency score was the number of responses 
submitted by each S. A fluency score on these tasks was possible since 
each S was asked to think of as many responses as possible in the imposed 
time limit of 5 minutes for each part of the Ask-and-Guess test and for 
each of the A, B, C tasks. It should be mentioned, that all eight scorers 
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first viewed the Ask-and-Guess picture pertaining to the set of papers 
being scored. Sets of papers for each picture were scored separately. 

Procedures and Design 

Training 

The experimental training program was conducted in sixth grade 
classrooms located in two schools. The writer served as the experimenter 
who presented the training task to all six classrooms over fourteen 
consecutive days of training. The three classes which served as the non- 
re inf orced experimental groups which received practice only were trained 
first each morning between the hours of 9:15 and 10:00 in a consistent 
room order of 22, 23, and 24. The other three classes which comprised 
the reinforced experimental trained group were trained between the hours 
of 11:00 and 11:55, immediately following their morning recess period in 
a consistent room order of 10, 11, and 12. 

Design 

The experimental procedure of the study consisted of a counterbalanced 
design of both criterion tasks and training tasks across all six class- 
rooms of the two experimental groups. A similar counterbalanced design 
was adopted for the criterion tasks given to the control group. Figure 8.01 
is a representation of the counterbalanced design for the six criterion 
tasks for the three groups. 

Fourteen days of training for creative performance were conducted by 
an experimenter across both experimental trained groups. Training task 
word lists were counterbalanced on the 7th, 11th, and l4th days of training. 
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Legend: 



Task A - 
Task B - 
Task C - 
Task P^- 

Task P 2 “ 

Task P - 
3 



Lost Problem 
Maier Rope Problem 
Rocket Ship Situation 
Torrance Ask-and-Guess Picture 
Task (Old Mother Hubbard Picture) 
Torrance Ask-and-Guess Picture 
Task (Small Business Picture) 
Torrance Ask-and-Guess Picture 
Task (Cat in the Well Picture) 




Friday Friday- 3 wks Firday-5 wks 
Pre-Training later later 

Post- training Retention 

Figures'.B.Ol# Representation of counterbalanced experimental design 
for criterion tasks. 
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Eleven other training word lists were repeated across all six classes 
for each of the other training days. A representation of the counter- 
balanced experimental design for the training task is shown in Figure 8.02. 

Several weeks following the completion of training the experimenter 
again visited all six classrooms of the experimental trained groups. A 
form was distributed to each of the six classroom teachers asking them 
to choose their five most original and imaginative pupils. Their choice 
of the five were to be ranked from one to five as the most original child, 
next most original, and so on. Written instructions on the form were as 
fol lows: 



May we OLtain your identification of the five most 
original and imaginative pupils in your class. These are 
not the grade-getters nor the most logical or critical 
thinkers, but those most creative and different in their 
thinking. As you evaluate your class, rank your most five 
creative thinkers as: 

In this manner data on teachers choice of creative pupils was 
obtained so that a comparison could be made between those pupils recog- 
nized and selected by the teachers and those whose performance scores 
were the highest on the tasks selected to measure creative performance for 
the study. 



RESULTS 

If originality is a characteristic which is relatively stable from 
day to day in the amount elicited by a given task, then scoring procedures 
for the task could be tested for reliability. The reinforcement scoring 

of originality for each S on the even days of training (2nd, 4th 

l4th) was compared with the originality scores on the odd days (1st, 3rd 
13th) of training. Comparison of the odd and even days scores 



o 



8.38 



II 




Legend: WL - Training Word 

List 




Training Days (l4) 

RgUreii8.02. Representation of counterbalanced experimental design 
for training task. ^ 
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of each S in Room 10 yielded a correlation (reliability coefficient) of 
.673 (N=16). The reliabilities of odd and even day originality scores 
of each S in Rooms 11 and 12 were .743 (N=17) and .608 (N=15). The overall 
reliability coefficient of the reinforcement scoring for the reinforced 
classes (N=48) of the experimental sample was .615. These reliability 
coefficients, considering the difficulty of scoring a variable such as 
originality, do show a considerable degree of reliability. This would in 
turn indicate that the trait designated “originality'* as measured in this 
study appears as a relatively stable trait in a given S. 

Although it was shown that the scoring of the reinforcement group 
was relatively reliable, it remained to be shown that the magnitude of 
those reliability coefficients was not just a function of the particular 
scorer, but that they were a reflection of the fact that such a measuring 
device could be scored reliably. The responses on the word list papers 
of the first day of training for both experimental groups were scored for 
originality by eight scorers. The papers were randomly assigned to eight 
different sets and each set scored by two scorers. The correspondence of 
the scoring of these two scorers for individual items was ascertained by 
the use of contingency coefficients. The contingency coefficients obtained 
were .3I, .19, .26, .18, .11, .36 and .15 for the eight pairs of scorers, 
and .22 over-all. All were significantly different from zero (p<.001 for 
all except the contingency coefficient of . 1 1 which was significant at the 
.05 level). Although the contingency coefficients are low numerically, 
it must be remembered that the highest possible contingency coefficient 
is .707 for two categories and this accounts in part for their small 
numerical size (Guilford, 1956, p. 316). Further, these coefficients 
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represent the reliability of the scoring for each item. The scores used 
in the subsequent analysis, however, were based upon two scorers. 

Ten items per subject for each of the three counterbalanced training 
days were scored independently by each scorer. The number of responses 
selected as original by each scorer were summed to yield a total originality 
score for each S, and this score was used in the analyses of variance which 
follow. The scores obtained on each of the three days were summed to 
provide a total originality score that was used for the correlation 
analysis, A correlation of 0.48 was obtained by correlating the scores given 
to each case on the group of 30 items by one set of scorers and the scores 
given to each case on the same group of items by another group of scorers. 

The reliability of the two scores combined may be expected to be higher than 
this value and is estimated to be 0.69. The reliability of the originality 
score derived from the training task is also indicated by the fact that the 
score derived from the three counterbalanced days correlated 0,48 (N=96) 
with the originality score derived later from the Torrance Ask-and Guess 
test. 



The Analvs i s of the Data 

Analyses of variance were computed to determine the main effects of 
training, reinforcement, and transfer to the criterion tasks. A fixed 
model, 3x6 factorial design with repeated measures and unequal number 
of subjects per group was used in the analysis of variance of originality 
scores derived from training tasks. This same model was also used in 
the analyses of originality and fluency scores derived from the Ask- and - 
Guess and ABC criterion tasks for the experimental sample. A 3 x 3 




factorial design with repeated measures and unequal number of subjects 



per group was used in the analysis of fluency scores derived from criterion 
tasks for the control group. After each analysis of variance was completed 
the significant effects were studied using an individual degree of freedom 
test to determine which factors accounted for most of the variance. Each 
of these analyses with supporting data will be presented and discussed 
separately as follows: 

Analys i s of Variance for Counterbalanced Training Tasks of Both Experi - 
mental Groups 

An analysis was made of the scores on originality across counter- 
balanced training task word lists by days and by classes. It was hypothe- 
sized that the simple training procedure employed in the experiment along 
with daily reinforcement would tend to facilitate original responses on 
the training tasks. An analysis of variance was carried out to test this 
hypothesi s. 

Table 8.02 presents the means and standard deviations of the data 
used for this analysis. The summary table for the analysis of variance 
of the originality scores on counterbalanced training days for both 
experimental groups is found in Table 8.03. As can be seen from the 
table, there were no significant effects. Thus, there is no increase 
in originality across training days. The other expectation that 
reinforcing original responses would result in more originality as evidenced 
by this training task was also not supported. There were no significant 
differences between classes within schools, and neither was there a 
difference between the reinforced and the nonre i nf orced classes. 
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MEANS AND STANDARD DEVIATIONS FOR REINFORCED AND NONRE INFORCED 
EXPERIMENTALLY TRAINED Ss ON ORIGINALITY FOR 
THE THREE COUNTERBALANCED TRAINING DAYS 









7 


Trai ninq 


Days 
1 1 




14 






N 


X 


SD 


X 


SD 


X 


SD 


Xx 


Reinforced 

Classes 


















10 


16 


2.81 


2.37 


3.44 


2.48 


2.63 


1.96 


2.96 


11 


17 


3.35 


2.18 


1.71 


1.65 


2.47 


1.94 


2.51 


12 


15 


2.00 


1.65 


1 .60 


1.50 


3.20 


2.34 


2.27 


Total 


48 


2.75 




2.25 




2.75 




2.58 


Nonreinf orced 
Classes 


















22 


17 


2.06 


1.39 


2.24 


1.39 


2.06 


2.41 


2.12 


23 


13 


2.54 


2.14 


2.00 


2.00 


3.15 


2.51 


2.57 


24 


18 


2.61 


2.68 


3.06 


3.21 


2.83 


2. 18 


2.83 


Total 


48 


2.40 




2.50 




2.64 




2.51 
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TABLE 8.03 

ANALYSIS OF VARIANCE FOR COUNTERBALANCED TRAINING TASKS 
FOR THE TWO EXPERIMENTALLY TRAINED GROUPS 
SCORED FOR ORIGINALITY 



Source of Variation 


Sum of 
Squares 


df 


Mean 

Squares 


F 


P 


Between Subjects 




95 








Classes 


24.57 


5 


4.91 


-- 


- 


Schools (Reinf. vs. 
Nonre i nf orced 


.12 


1 


.12 


-- 


- 


Class/Schools 


24.45 


4 


6.11 


— 


- 


Subjects/Classes 


543.85 


90 


6.04 


-- 


- 


Within Subjects 




192 








Days 


6.97 


2 


3.49 


— 


- 


Class X Days 


54.52 


10 


5.45 


1.35 


- 


Res i dual -subjects 












Within groups 


727.52 


180 


4.04 


— 


- 



1357.43 287 



Total 



8.44 
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Analysis of variance for the Counterbalanced Ask-and - Guess Criterion Task 
of Both Experimental I y Tra i ned Groups Scored for Original i ty 

The reader will recall that the Torrance Ask-and-Guess Test was 
scored for both originality and fluency. Both of these abilities as 
factors of creative performance were analyzed separately since they 
yielded two distinct scores. Two analyses of variance were completed for 
testing differences between classes and days in terms of the Ask-and-Guess 
tasks which were presented before training, after training and two weeks 
following training. The hypothesis was that Ss would learn to respond with 
a greater number of original ideas as a result of training and such learn- 
ing would transfer from the training task to the criterion tasks. An 
analysis of variance was completed separately for originality and fluency 
scores to test this hypothesis. In Table 8,04 are the means and standard 
deviations for the originality scores obtained from the Ask-and-Guess 
Test. The summary table for the analysis of variance for these data is 
shown in Table 8.05. As can be seen from the table, class differences 
within schools were significant. This effect can be seen from Table 
8.06 to be largely a result of the fact that rooms 10 and 23 had sub- 
stantially higher scores than the other rooms. The reinforced group was 
not different from the nonrei nforced group classes. 

The effect uf days was not significant. This shows once again that 
training and/or reinforcement does not increase the amount of originality 
displayed by the Ss. It should be repeated that the across days comparison 
here involved a pre-training task, a post-training task, and .a retention 
task 9 iven two weeks after the post-training task in which it becomes 




evident that the amount of originality displayed by the groups as a whole 



TABLE 8.04 



MEANS AND STANDARD DEVIATIONS FOR REINFORCED AND NONRE INFORCED 
EXPERIMENTALLY TRAINED Ss ON THE ASK-AND-UUESS CRITERION 
TASK ADMINISTERED BEFORE, AFTER AND TWO WEEKS 
FOLLOWING TRAINING SCORED FOR ORIGINALITY 



Time of Administration 

Two Weeks 

Pre_Training Poj.t Training Fo1J.owing Training 
N X SD X SD X SD Xx 



Reinforced 

Classes 



10 


16 


6.88 


2.82 


1 1 


17 


6.24 


3.53 


12 


15 


6.40 


3.70 


Total 


48 


6.50 




Nonre i nf orced 
Classes 








22 


17 


6.06 


3.77 


23 


13 


8.15 


3.08 


24 


18 


7.44 


4.12 


Total 


48 


6.94 





9.94 


3.30 


9.50 


7.67 


8.77 


6.59 


1.79 


7.06 


3.03 


6.63 


6.00 


2.62 


5.93 


3.99 


6.11 


7.52 




7.51 







6.59 


4.86 


6.65 


5.30 


6.43 


9.15 


3.48 


9.38 


5.99 


8.89 


6.72 


3.94 


7.28 


2.63 


7.30 


7.33 




7.63 







TABLE 8.05 



ANALYSIS OF VARIANCE FOR COUNTERBALANCED DAYS AND CLASSES OF THE 
TWO EXPERIMENTALLY TRAINED GROUPS ON THE ASK-AND-GUESS 
CRITERION TASK SCORED FOR ORIGINALITY 



Source of Variation 


Sum of 
Squares 


df 


Mean 

Squares 


F 


P 


Between Subjects 




95 








Classes 


346.30 


5 


69.26 


2.91 


.05 


Schools (reinf. vs. 
Non reinforced) 


.35 


1 


.35 


— 


— 


Class/Schools 


345.95 


4 


86.49 


3.63 


.05 


Subjects/Classes 


2142.00 


90 


23.80 


— 


— 


Within Subjects 




192 








Days 


30.54 


2 


15.27 


1.16 


— 


Class/Days 


81.84 


10 


8.18 


— 


— 


Residual — subjects 












Within Groups 


2360.00 


180 


13.11 


— 


— 


Tota 1 


4960.68 


287 
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TABLE 8.06 

MEANS AND STANDARD DEVIATIONS FOR REINFORCED AND NONRE INFORCED 
EXPERIMENTALLY TRAINED Ss ON THE ASK-AND-GUESS CRITERION 
TASK ADMINISTERED BEFORE, AFTER AND TWO WEEKS FOLLOWING 
TRAINING SCORED FOR FLUENCY 











Time of 


Admi n i strat i on 








N 


Pre Training 
X SD 


Post 

X 


Training 

SD 


Two Weeks 

Fol 1 owi ng Training 
X SD Xx 


Reinforced 

Classes 


















10 


16 


30.81 


12.65 


46.06 


15.39 


33.56 


14.71 


36.81 


1 1 


17 


24.94 


9.27 


30.59 


12.91 


37.12 


19.63 


30.88 


12 


15 


28.07 


11.57 


34.67 


12.55 


40.33 


20.13 


34.36 


Total 


48 


27,87 




37.02 




36.94 






Nonreinforced 

Classes 


















22 


17 


20.71 


10.05 


24. 12 


11.32 


22.35 


15.04 


22.39 


23 


13 


27.08 


11.87 


37.23 


16.98 


35.92 


21.17 


33.41 


24 


18 


34.06 


16.18 


30.94 


11.12 


44.83 


12.43 


36.61 


Total 


48 


27.44 




30.23 




34.46 


• 


30.71 
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is remarkably stable. In fact, the indication is that originality may be 
a very stable trait which is little affected by direct training or rein- 
forcement, This does not exclude the possibility that some forms of 
indirect training may not be effective. 

Analysis of Var i ance for the Counterbalanced Ask-and - Guess Cr i ter i on 
Task of Both Experimental Groups Scored for fluency 

Table 8 .O 6 shows the means and standard deviations for this analysis. 
The summary Table 8.07 of the analysis of fluency for the Ask-and-Guess 
criterion test indicates the following: When the fluency scores of the 

two experimental groups (reinforced and nonre i nf orced classes) on the Ask~ 
and - Guess Task were compared, the main effects of classes (p<.05) and days 
(p<.00l) were significant as well as the interaction (p<.001). When the 
between classes effect was broken apart, no difference was found between 
the reinforced group and the non- re inf orced group. The variance related 
to this effect was quite widely distributed across the various classes 
and no generalizations could be made. 

In view of the findings derived from the other two analyses of 
variance that training did not affect the amount of originality displayed 
by the Ss; the above finding that fluency (total number of responses) did 
increase across days indicates a practice effect on the criterion tasks 
independent of training since fluency was not required on the training 
task. This suggests that originality may be a very stable trait which 
cannot be altered by this kind of training; and which does not display 
a practice effect. Even though Ss can benefit from practice and can learn 
to respond more fluently, the data indicate that they apparently cannot 
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TABLE 8.07 

ANALYSIS OF VARIANCE FOR COUNTERBALANCED DAYS AND CLASSES OF 
THE TWO EXPERIMENTALLY TRAINED GROUPS ON THE 
ASK-AND-GUESS CRITERION TASK 
SCORED FOR FLUENCY 



Source of Variation 


Sum of 
Squares 


df 


Mean 

Squares 


F 


P 


Between Subjects 




95 








Classes 


6819.8 


5 


1363.9 


3.05 


.05 


Schools (Reinf. vs. 
Non re i nforced 


250.3 


1 


250.3 




m* mm 


Classes/School s 


6569.5 


4 


1642.4 


3.68 


.05 


Subjects/Cl asses 


40211.7 


90 


336.8 


-- 




Within Subjects 




192 








Days 


3417.8 


2 


1708.9 


19.80 


.001 


Tasks 


1541.1 


2 


770.6 


8.65 


.001 


Interaction 


1735.5 


8 


217.1 


2.44 


.025 


Residual 












Within Subjects 


l604l.0 


180 


89.1 


-- 


-- 


Total 


69767. 9 


287 
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be trained to be more original. The significant interaction (P«=.025 
level of confidence) may suggest that something other than supposedly 
controlled experimental conditions was causing the interaction. There was 
a considerable amount of variation with respect to which of the three days 
produced the highest fluency for each room. For example, with Room 10 the 
post test produced the most responses, with Room 11 Ss the later test 
produced the most responses, and so on. This suggests that incorrect 
timing could be the cause since these were regular classroom exercises 
given by the regular teacher and integrated within the program of the 
school day. 

Anal ys i s of variance for the Counterbalanced ABC Criterion Tasks of Both 
Reinforced and Non reinforced Groups Scored for Fluency 

With the preceding evidence which revealed no significant differences 
in originality scores as shown by the analyses of variance for the 
counterbalanced training tasks and the Ask- and- Guess criterion task across 
days and classes it was hypothesized that it would be unlikely for any 
differences to occur on originality across the other teacher controlled 
ABC criterion tasks. In, order to relieve the scoring burden, these teacher 
controlled criterion tasks were only scored for fluency by a single 
scorer. An analysis of variance was completed in order to again test the 
effects of practice and transfer from an experimenter training task to a 
teacher controlled classroom task calling for a fluent number of responses. 
The means and standard deviations for these data are shown in Table 8,08, 
The analysis of variance for the data on the ABC criterion tasks for the 
experimental groups scored for fluency is presented in Table 8,09. 



TABLE 8.08 



MEANS AND STANDARD DEVIATIONS FOR REINFORCED AND NONRE INFORCED 
EXPERIMENTALLY TRAINED Ss ON THE “ABC" CRITERION TASKS 
ADMINISTERED BEFORE, AFTER AND TWO WEEKS 
FOLLOWING TRAINING SCORED FOR FLUENCY 







Time of 
Pre Training Post 


Admi ni strat i on 

Two Weeks 

Training Following Training 




N 


X 


SD 


X 


SD 


X 


SD 


Xx 


Reinforced 

Classes 


















10 


16 


4.50 


1.51 


8.25 


5.89 


11.56 


7.11 


8. 10 


1 1 


17 


10.29 


4.12 


4.59 


1.66 


7.88 


3.85 


7.59 


12 


15 


9.87 


3.50 


8.73 


2.60 


3.87 


0.52 


7.49 


Total 


48 


8.23 




7. 10 




7.85 




7.73 


Nonreinf orced 
Classes 


















22 


17 


8'.71 


4.37 


7.41 


4.54 


4.29 


1,65 


6,80 


23 


13 


3.92 


0.64 


11.08 3.71 


8.69 


2.92 


7.90 


24 


18 


6.00 


1.97 


3.83 


0.51 


10.28 


5. 14 


6.70 


Total 


48 


6.40 




7.06 




7.73 




7.06 



i 

r 





8.52 
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TABLE 8.09 

ANALYSIS OF VARIANCE FOR COUNTERBALANCED DAYS AND CLASSES 



OF THE 


TWO EXPERIMENTALLY 


TRAINED 


GROUPS ON 


THE 




"ABC 


" CRITERION TASKS 


SCORED FOR FLUENCY 






Source of Variation 


Sum of 
Squares 


df 


Mean 

Squares 


F 


P 


Between Subjects 




95 








Cl asses 


77.13 


5 




— 


— 


Schools (Reinf. vs. 
Non reinforced 


10.77 


1 


10.77 


— 


-- 


Classes/School s 


66.36 


4 


16.59 


-- 


-- 


Subject s/ Cl asses 


2027.70 


90 


22.53 


— 


— 


Within Subjects 




192 








Days 


16.84 


2 


8.k2 


— 


— 


Tasks 


1524.26 


2 


762.13 


8.66 


.001 


Interaction-Tasks x 
Days 


425.45 


8 


53.18 


6.04 


.001 


Res i dual — subjects 












Within Groups 


1584.00 


180 


8.8 


-- 


-- 


Total 


5655.38 


287 
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The effects of classes and days were not significant. There were no 
significant differences in the mean fluency scores on these tasks between 
classes or across the three testing occasions. The significant interaction 
(p<.001) may be the result of one task (B) being significantly more 
difficult than the other two. Table 8.10 presents means for all six 
classes arranged by tasks. Means by tasks indicate that task B was 
essentially twice as difficult as the other two. Since this task was 
counterbalanced across days with respect to classes, the fact that it was 
more difficult could produce the interaction effect. The Latin square 
used in the counterbalancing of tasks and days hardly permits the dis- 
covery of an interaction which has much meaning. 

Anal vs i s of yar i ance for the Counterbal anced Ask - and - Guess Cr i ter i on Task 
of the Control Group Scored for Fluency . 

A control group was used to measure any training effect that might be 
attributable to the taking of the tasks themselves. The Asj<- and- Guess 
criterion task was conducted over the same time interval for the control 
group as was the testing for experimental groups; that is, over one, three 
and five weeks time. Because fluency scores for the experimental groups 
were significantly different across the three days on the Ask-and - Guess 
task, the control group was scored for fluency on the same task to see if 
differences would be due to training. The question that remained to be 
answered was whether training affected fluency even though it didn't 
include a fluency task. An analysis of variance was completed to test 
if there were any significant differences between counterbalanced tasks 
on the three testing days. Table 8.11 gives the means and standard 
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TABLE 8,10 

MEANS FOR THE COUNTERBALANCED "ABC" CRITERION 
TASKS OF THE TWO EXPERIMENTALLY TRAINED 
GROUPS SCORED FOR FLUENCY 
ARRANGED BY TASKS 



Classes 


A 


Task 

B 


c 


10 


8.25 


4.50 


11.56 


11 


7.88 


4.59 


10.29 


12 


9.87 


3.87 


8.73 


22 


8.71 


4.29 


7.41 


23 


11.08 


3.29 


8.69 


24 


10.28 


3.83 


6.00 



8.55 



TABLE 8.11 

MEANS AND STANDARD DEVIATIONS FOR CONTROL GROUP Ss ON THE 
ASK-AND-GUESS CRITERION TASK ADMINISTERED OVER THE 
SAME TIME INTERVAL AS THE EXPERIMENTALLY 
TRAINED GROUPS SCORED FOR FLUENCY 



Time of Administration 

First Week JThird Week __Fifth Week 



Classes 


N 


X 


SD 


X 


SD 


X 


SD 


3 ^ 


1 


11 


26.55 


8.72 


22.82 


8.61 


34.82 


17.73 


28.06 


2 


15 


19.73 


7.31 


23.13 


8.04 


22.87 


11.51 


21.91 


3 . 


12 


33.83 


11.13 


30.67 


7.66 


36.42 


12.32 


33.64 


Totals 


38 


26. 16 




25.42 




30.61 




27.40 



8.56 
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deviations for the scores. As can be seen from Table 8.12, the effects of 
days and classes are significant (p<.001 and p<.05 respectively). Because 
of the IQ, differences across classes, the effect of classes was broken 
apart and it was found that the differences between the lowest mean IQ class 
and the highest mean IQ class was not significant (F=2.35, 4.13 needed for 
the P=.05 level of confidence). 

The effect over time (days) is obviously accounted for in terms of 
the difference between the first two days and the last. The gain suggests 
that a similar gain found in the case of the trained group is not a result 
of training but is probably due to some facilitation resulting from the 
taking of the tests. 

Studies Rel at i nq Measures of Original i tv and Fluency to 
Measures of Pupi 1 Performance 

Correlations of I ntel I igence with Original itv and Fluency 

Table 8.13 indicates correlations of The California Mental Maturity 
Scale with measures of originality and fluency scores on all tasks adminis- 
tered the two experimentally controlled groups. The correlations of this 
measure of intelligence with originality as measured by the training task, 
are shown in Table 8.13. Although with only one exception the correla- 
tions by rooms were not significant (r=.51, P<.05 for Room 24), the 
correlations by room were all positive and did indicate a trend for the 
two factors to be positively correlated. When the scores were not sep- 
arated by room but were considered altogether, the correlation was 
significant (r=.27, N=96, p<.05). One could conclude that there was a 
significant correlation between intelligence ( California Mental Maturity 
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TABLE 8. 12 

ANALYSIS OF VARIANCE FOR COUNTERBALANCED DAYS AND CLASSES 
OF THE CONTROL GROUP ON THE ASK- AND- GUESS 
CRITERION TASK SCORED FOR FLUENCY 





Sum of 




Mean 






Source of Variation 


Squares 


df 


Squares 


F 


P 



Between Subjects 
Classes 


2563.08 


2 


1281.54 


4.90 


.05 


Subjects within 
groups 


9193.00 


35 


262.70 


-- 




Within Subjects 
Days 


709.28 


2 


354.64 


12.64 


.001 


Interaction 


41.60 


4 


10.40 


-- 


— 


Residual — subjects 
Within Groups 


1964.00 


B. 


28.06 






Total 


5470.96 


113 









8.58 
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TABLE 8.13 

CORRELATIONS OF INTELLIGENCE WITH ORIGINALITY AND FLUENCY SCORES 
ON THE TRAINING AND CRITERION TASKS FOR THE TWO 
EXPERIMENTALLY TRAINED GROUPS 



Room 


N 


Or iqi nal i tv 

Training Ask- and- Guess 
Task Criterion Task 


Fluency 
Ask- and- Guess 
Criterion Task 


"ABC" 

Cr i ter i on 
Task 


Reinforced 














10 


16 


.30 




.29 


.18 


.38 


11 


17 


.45 




. 16 


.46 


.04 


12 


15 


.04 




.26 


• 

00 


.18 


Non re i nf orced 














22 


17 


.15 




.02 


.24 


.00 


23 


13 


.30 




.23 


.09 


.24 


24 


18 


.51* 




CM 

• 


.00 


.03 


Overal 1 


96 


.27* 




.28* 


.23* 


.13 


r's converted to 
Fisher's Z functions 
averaged and recon- 
verted to rv. 


9 











*p<.05 



8.59 
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Scale ) and originality as measured by this task for the two groups together. 

The correlations of Intelligence with originality on the Ask - and - 
Guess criterion task by rooms w^re positive (with one exception for Room 
22) and while Individually nonsignificant, were collectively significant. 

In order to obtain an over-all correlation for all classes It was necessary 
to combine them In such a way that class differences would not affect the 
relationship since It was suspected that some teachers may have allowed 
excessive time In administering the Ask-and - Guess Test . This was done by 
converting them to Fisher's Z function, averaging the values, and recon- 
verting them to an over-all correlation. This was done for all correlations 
on Individual classes. When this correlation was made the over-all 
correlation of originality measured by the Ask - and - Guess criterion task 
with Intelligence was .20 which was significant (p<. 05). 

The fluency scores on the Ask-and Guess criterion task correlated 
with Intelligence. These correlations varied from moderate positive to 
zero. Once again. In order to eliminate any class differences which may 
have resulted from differences In timing the Individual room correlations 
were converted to Fisher's Z function, averaged, and reconverted to an 
over-all correlation. An over-all significant correlation of .23 was 
obtained (p<.05). 

Fluency scores derived from the ABC criterion task were also cor- 
related with Intelligence test scores. For all rooms, these were all 
positive and low with the exception of one room (22) which was zero. The 
correlations for all rooms were again converted to Fisher's Z function, 
averaged and reconverted to an over-all correlation of .14 which was not 
significant. 
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An cver-all correlation of intelligence with fluency as measured by 
the Ask - and - Guess criterion task for the control group was .51. This 
would lead to the conclusion that this measure of fluency was correlated 
with intelligence (p<. 01). 

Cor re 1 at i ons of Training Task Originality Scores Cor re 1 ated wi ch Teacher 
Grades . C i t i zensh i p , Study Hab i t s and Grade P 1 acement on ^ Standardized 
Achievement Test 

Correlations between teacher academic grades, citizenship grades, 
study habits grades, standardized achievement test scores and originality 
and fluency scores on the counterbalanced Ask-and-Guess Test for each 
experimental group are presented in Table 8.14. 

Several points may be made concerning the correlations obtained 
from these data. It is apparent from the significant correlations of 
originality measures derived from scores on both the training tasks and 
the Ask-and-Guess task with the measures of pupil performance that these 
teachers do not discriminate against the original, imaginative child. 

In fact, the data indicate that there is a positive relationship between 
grades, citizenship and study habit marks given by teachers on the one hand, 
and the student’s originality and fluency performance on the measures used 
in the experiment. It is interesting to note the size of correlations 
(.31 and .37) obtained between mathematics, science and social studies 
grades with originality and fluency measures on the Ask-and-Guess task. 

In addition, the correlation of .375 was found between these subject 
matter grades and the training task originality scores. The fact that 
the correlation of originality with language arts, art and music grades is 
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even higher (.42) as measured by the Ask- and- Guess task indicates that these 
teachers recognize and rewarded creative students. The higher correlation 
of originality with language, art and music might be expected since these 
subjects, especially language arts, perhaps deal more with the same kinds 
of creative abilities as do the written tasks used in the experiment for 
measuring originality. One of the most significant findings is the size 
of the correlations of both originality and fluency as measured by the 
Ask-and-Guess tests with citizenship and study habit marks. These correla- 
tions certainly indicate that these teachers do not discriminate against 
students who measure high on originality and fluency. Another most impor- 
tant speculation is made when observing the significant correlations of 
originality (.42) as measured by the Ask-and-Guess criterion task with 
grade placement as determined by a standardized achievement test score. 

This correlation, along with those of originality with subject matter grades, 
could indicate that knowledge may be a key factor in the functioning of 
creative abilities. Original responses general ly requ ire a breadth of 
information and experience. At least it shows that the student who has 
sufficient knowledge to place at a higher grade level than where he is 
now also performs better on creativity measures involving original think- 
ing. This observation is even more interesting when observing the 
correlations of achievement test grade placement with training and 
criterion measures for originality (.42 and .31) as compared with fluency 
(.15). Apparently the actual knowledge of subject matter content has a 
higher relationship with original thinking than the ability to merely 
provide a stream of relevant responses regardless of their originality. 

The higher originality correlations across all of the measures of pupil 
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performance as compared with fluency correlations may indicate additional 
evidence to support this position. 

The last observation which should be pointed out when interpreting 
the resulting correlations in Table 8. l4 is that of individual room com- 
parisons. With the exception of a few scattered deviant correlations, 
most of the teachers were fairly consistent except one, Room 24. It 
should be noted that this teacher apparently places a different emphasis 
on citizenship and study habit marks than the other five teachers. 

Teacher 's Select i on of Original and Imaginative Students 
Related to the Ask - and - Guess Original ity Scores 

It should be recalled from the chapter discussing the experimental 
method and procedure that all six teachers of the experimental classes 
were asked to select and rank their five most original and imaginative 
students. The criterion for an original and imaginative student was 
explicitly given in written form to each teacher as "not necessarily 
the grade-getter nor the most logical, but the student most creative and 
different in his or her thinking." 

Originality score derived from the Ask-and - Guess criterion task 
of those students so selected by each classroom teacher was then com- 
pared by classes with originality scores derived from the same measure 
by the remaining students in the experimental sample not so selected by 
the teachers. Table 8.15 presents these data for each of the six class- 
rooms of the experimental sample. Only one classroom teacher's selection 
(room 10) of his most original and imaginative students approached 
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TABLE 8.15 



COMPARISON OF MEAN ORIGINALITY SCORES BY CLASS DERIVED FROM 
THE ASK-AND-GUESS CRITERION TEST FOR THOSE STUDENTS 
SELECTED BY EACH CLASSROOM TEACHER AS MOST 
ORIGINAL AND THE REMAINING STUDENTS 
NOT SELECTED 



orri 


Mearij 
Selected 
by Teacher 


Mean2 

Not 

Selected 


Differences 


t 


df 


10 


35.00 


23.42 


11.58 


2.05* 


15 


11 


17.25 


20.69 


- 3.44 


0.91 


16 


12 


20.50 


17.55 


2.95 


0.81 


14 


22 


24.33 


18.21 


6. 12 


0.98 


16 


23 


26.00 


26.90 


- 0.90 


0.13 


12 


24 


20.50 


21.71 


- 1.21 


0.33 


17 



significance (p<.05) when t tests were computed. All of the other five 
classroom teachers were unable to select their five most original 
students at any significant level of confidence as determined by the 
originality measure derived from the Ask-and-Guess test. These data 
may be interpreted to indicate that whatever the classroom teachers' 
criterion for an original student may be, it is app“^rently not the same 
as that which is being measured by the Ask-and - Guess Test. In essence, 
at least five of the six teachers were not selecting the same students 
on the basis of originality that the test measure did. This data may 
indicate that there is little agreement between teacher's judgment of 
originality and this particular test's measure of supposedly the same 
trait. Just why this disagreement occurs cannot be determined from 
these findings. 



Discuss i on 

This study was undertaken with considerable scepticism concerning 
the trainability of original responses. While the Maltzman studies 
had generally shown that practice in the generation of original res- 
ponses showed no transfer effect to performance on the Gui I ford Unusual 
Uses Test , one could argue that little transfer might have been produced 
owing to the very limited amount of training given. The hope of the 
present study was that with more extended practice a training effect 
might appear. In this respect, the data derived from the present study 
is unequivocal for there was not the slightest sign that training had 
any effect on the development of a trait of originality. 
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Perhaps the results of the present study are to be expected. First, 
it is very clear from an examination of the responses produced to the 
training task that the production of original responses is highly related 
to the amount of information possessed by the individual. A child who is 
asked to find an original use for a pencil and who suggests that the 
carbon might be used in building an atomic furnace is a highly knowledge- 
able youngster. The limit on original responses would appear to be the 
amount of stored knowledge of the person who is attempting to make the 
response. Practice in the making of original responses is hardly likely 
to increase the amount of knowledge stored. There is still the possi- 
bility that practice may somehow permit the individual to learn how to 
bring one stored item of information into a relationship with another 
stored item of information. Some comment on this possibility is in order. 

Most modern psychological theories take the position that a partic- 
ular item of information is not stored in a particular location in the 
brain, but rather that there is a redundancy of storage. Considerable 
physiological evidence supports such a position. The result of this is 
that there is a bui 1 t- i n- rel at i onsh ip between any item of information 
and a vast range of other items of information. Associationist theories 
generally assume that these relationships can be represented by a hierarchy 
of associations of different strength. Such a hierarchy of relationships 
involving even vastly different items of information is possible because 
of the high redundancy of the storage system. Associat ionistic theory 
also generally takes the position that the associations between items of 
information is a result either of relationships established through 
circumstances of learning or that the relationships may exist through the 
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accidents of storage which tend to bring one item of information into 
relationship to another. At no place in such a theory is the idea 
introduced that such associations can be developed by some kind of 
training which develops skill in locating one item of information in the 
brain and then finding another item with which it has not been associated 
and then, somehow, tying the two together. From the point of view of 
assoc i at i on i Stic theory , as wel 1 as any other aspect of learning theory, 
this notion is plain nonsense. The associations available are presumably 
built i nto the bra i n. 

The associat i on i St ic view does not imply that all associations bet- 
ween concepts or elements of information can be manifested at any time. 
Some of the weaker associations can function only on rare occasions 
or under exceptional circumstances. Hebb (19^9) has pointed out that 
accidental events, such as the simultaneous arrival of two nerve impulses 
at the same point at the same time, may trigger neural events which 
would not otherwise occur. Such accidental triggering would appear to 
play a role in the production of remote associations. The only exter- 
nally controllable condition which one can identify which mi ght f aci 1 i tate 
the production of such remote associations would be the inhibition of 
common associations. Thus the .notion that practice in the giving of 
remote associations will facilitate such an activity seems to be 
psychologically very unlikely. The educational theory that the basic 
elements of creative talent, other than the cogn i t i ve^ are trainable has 
emotional appeal but little scientific status. 

The correlations between the measures of originality and fluency and 
measures of educational performance are interesting. They give.no 



8.68 



II 



support to the reckless statements which have been made to the effect 
that teachers generally discriminate against the creative child. In 
only one class out of the six main experimental classes was there the 
slightest evidence that this was so. The child who scores high on a 
test of originality is likely to have higher than average grades and 
is likely to be better informed than the average child. In the class 
that is the exception in this study, the teacher involved was quite 
hostile to the study and was observed to have an unusually severe and 
authoritarian approach towards children. Our observations in schools 
lead us to believe that the teacher-child relationship in this particu- 
lar class was an unusual one. The evidence from the study is consistent 
with the position that the original child, who inevitably brings so many 
ideas of interest to a class, is generally favored by the teacher. 

Teachers, like children, enjoy elements that bring freshness to the school- 
room atmosphere. 

Of interest are the correlations between originality scores and 
achievement as measured by standardized tests. Such tests reflect to a 
considerable extent the amount of information stored by the student from 
the academic areas which the tests cover. The correlations give support 
to the position already voiced that a crucial element in the production 
of original responses is knowledge. The latter position seems to be as 
sound as any that has been taken in the area and leads to the implica- 
tion that one of the best ways of training the ability to produce 
original responses is to provide the pupil with a rich background of 
knowledge. It may well be that the acquisition of knowledge may have a 
far more powerful effect on originality than all of the training procedures 
which have been devised and which have such superficial attractiveness. 
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The first seven chapters in Section II form a series of closely 
related studies with findings which can be considered collectively. The 
eighth chapter presents a different kind of problem - the application of 
reinforcement to a category of behavior which is not likely to be strength- 
ened by such a procedure. While the reinforcement principle has powerful 
app 1 i cat i ons t o many problems of teaching, the last of the series of studies 
presented here was undertaken with the idea that certain behaviors are not 
facilitated or increased in frequency through reinforcement but must be 
developed by other procedures. 

The first series of studies all involve a rote learning task somewhat 
similar to that observed frequently in classrooms. The general format of 
the task remained the same throughout the various investigations. The 
acquisition task was that of learning the meaning of 60 German words during 
three periods which were always scheduled on Monday, Tuesday, and Wednesday 
of each week. The task was chosen because few children in the grades 
studied had been exposed to German and, hence, previous learning on the 
part of some children was not likely to be a factor contaminating some 
of the data. The general form of the task was that of presenting a German 
word together with two English words. The task of the subject was to 
determine which one of the English words had a meaning equivalent to that 
of the German word. The task could be presented under a number of different 
conditions and the studies involved a determination of the relationship 
of the learning conditions to the degree to which the pupils learned the 
task. The level of acquisition was always measured on Friday, but in one 
study a delayed retention test was also administered some weeks later. 

The first study in the series, presented in Chapter II, investigated 
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the acquisition of the German vocabulary under conditions similar to those 
which commonly occur in classrooms. The situation involved eight pupils 
sitting in a row and an experimenter-teacher who presented the German 
words and the two English words on large cards. The teacher interacted 
with some of the pupils but not with others. When the pupil chose the 
correct alternative, he was told by the experimenter that he was right. 

When he chose the wrong alternative, the teacher said nothing and turned to 
another pupil with the next problem. This plan of reinforcement was selected 
because it had been observed previously that teachers typically say nothing 
when the response of the pupil is incorrect. In addition, some pupils 
learned the task from a teaching machine. Under one of the teaching machine 
conditions the pupil worked alone. In the other the pupil and the experimenter 
sat side by side and the experimenter said either "right" or a synonym 
for right when the pupil made a correct response. 

The rate of acquisition showed a typical learning curve with the 
strength of the acquired response being related to the number of trials with 
the particular item, in this study, those who had direct interaction with 
the teacher in the familar reci tat ion- type of learning situation were those 
who learned most effectively (on the items on which interaction occurred.) 
However, when some interaction was provided in a teaching machine type of 
learning situation a similar advantage was not observed. Indeed, when 
the feedback was provided both by the experimenter and by the machine a 
particularly inefficient learning situation was observed to exist. 

Those who learned by observing the other pupils did not perform as 
well on the achievement test as those that learned through a situation in 
which there was interaction with the teacher in a group situation. Their 
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these items which the interacting group had to learn by observation. This 
suggests that the advantage gained by the interacting group is confined to 
those items on which there is interaction with the teacher. In the inter- 
acting group there was no general raising of the level of the performance. 

The first study brings out clearly that, among the conditions studied, 
the pupil who responds orally and who interacts with the teacher learns 
best, and the pupil who plays a strictly passive role learns least. 

The second study of the series reported in Chapter III explores the 
effect of the form of the teacher's feedback on the achievement of the 
pupils in a recitation rote- learning situation of the kind used throughout 
the studies. In this study the form of the feedback was varied and also 
the amount and redundancy of the information transmitted. The general 
finding of this study is that the more complex the feedback and the less 
the use of the feedback is dependent upon the memory of the learner the 
more effective it is. The most effective feedback was that which repeated 
the essentials of the problem together with the right answer in the form 
"(German word) means (English word.)" This form of feedback involved 
redundancy and also eliminated the need for the learner to remember or 
review the problem for himself. In this second study, the form of feed- 
back which was most efficient for those pupils interacting with the teacher, 
was also most efficient for those learning by observation. The data 
suggest that whatever conditions make for effective learning of subjects^ 
who are both orally responding and interact ing^ are also the conditions 
which make for the most effective learning of those who learn by observation. 
Another significant finding of this study is that the subjects who interacted 



with the teacher showed less decline in learning over days than those who 
did not interact. This finding has come out in a number of places in our 
data and, in general, those who have learned alone without social interaction 
have shown substantially more decline over the three days than have the 
children working in situations where a greater amount of social interaction 
occured. This study, unlike the previous study produced some evidence to 
indicate that a pupil who responded on one item not only learned that item 
more effectively than did the pupil who did not respond, but also tended 
to learn more effectively the other items on which he did not respond. 

The experiment on the effectiveness of different kinds of feedback 
also provided data pertaining to the relationship of the judged attention 
of the student to the amount of learning taking place. The judgments placed 
heavy emphasis on the direction of the gaze and the general appropriateness 
of the body orientation. These aspects of behavior were found to be 
relevant for judging the extent to which pupils were learning in those 
situations in which the orientation of the subject was necessary for the 
acquisition of information.. For example, if the subject had to read the 
words on a card in order to learn the task, then the orientation of the 
subject towards the reading material is clearly a necessary condition for 
learning. However, under those conditions where all the information 
necessary for learning is transmitted through the auditory channel, 
then attention as it is judged in this study and as it is ordinarily 
judged is unimportant for acquisition of the material. 

Chapters V and VI presejnt studies in which pupils work with pupils 
and in which the responses of pupils are reinforced by other pupils rather 
than by teachers. This approach to teaching is an old one though there 
have been almost no data available concerning the effectiveness of learning 
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situations in which pupils teach pupils. In the study reported in 
Chapter V, elementary school pupils in grades 4, 5, and 6, learned under 
a number of conditions in which pupils assumed both the role of teacher 
and the role of pupil. In summarizing the results of this study and to 
prevent any unwarranted overgeneralization, the point must be made that 
the materials were carefully developed so that one pupil could easily 
fall into the role of teacher and the other could readily assume the role 
of pupil. Unless materials have been very carefully prepared and are such 
that they provide considerable structure for the behavior of the children 
involved, effective learning is unlikely to take place. 

Among the various conditions of learning investigated in this research, 
the least effective was that in which the pupil served as teacher. However, 
the teacher-pupils did learn substantial amounts despite the fact that 
their role does not provide optimum conditions for learning. On total 
learning scores, the pupils who responded to each item and who were 
reinforced by the teacher pupil obtained the highest average scores. The 
second highest average scores were obtained by pupils who worked through 
the deck of cards alone. The latter pupils also obtained average scores 
comparable to the pupils in a previous study who learned the same task by 
means of a teaching machine. Switching the pupil and the teacher role 
in the middle of the task did not have the effect of vitalizing learning 
in the manner that had been anticipated. Those who switch roles obtained 
mean scores almost exactly between those who acted throughout the entire 
session in a teacher role and those who performed throughout in a pupil 
role. 

A delayed test measuring retention was administered at varying 
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intervals after the time of acquisition. The most striking feature of the 
data on delay is the remarkably low loss over the 9 week period. This may 
be due to the fact that the task was unrelated to school learning and, 
hence, new material learned in school during the period would be unlikely 
to have a disruptive effect on the learning undertaken as a part of the 
experiment. The data fit well an interference theory of forgetting in 
that such a theory would predict that in this case little forgetting would 
Occur. 

An efficiency score was computed for each learning condition. This 
score was the time taken divided by the number of items answered correctly 
and thus could be considered as a measure of time per item learned. The 
group which had by far the highest efficiency scores were those who worked 
alone since they required only about half of the time utilized by other 
subjects in order to complete the tasks. However, the advantage of the 
isolated conditions is probably offset by the fact that this group showed 
a more rapid decline in performance over the three days than did any of 
the other groups. Social interaction appears to have the advantage of 
maintaining acquisition behavior at a relatively hi^h level. While the 
performance of pupils on the achievement test was comparable at all three 
grade levels, the efficiency scores did improve with age with the older 
children requiring less time than the younger for each item answered 
correct ly. 

Chapter VI is a follow-up of the study which has just been considered, 
and involved an extension of the work to the 7th and 8th grades. In addition, 
it sought to determine the effect of matching or mismatching the pupils in 
the pairs in terms of previous academic achievement in solid subjects. 



The 
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unexpected finding appeared that the pupils who were mismatched learned the 
task significantly better than those that were matched. The observations 
of the experimenters suggest that this finding may be a result of the 
mismatched pupils interacting more with one another while working in 
pairs than did the pupils in the matched pairs. At the seventh and eighth 
grade the efficiency scores of the pupils were no greater than at the lower 
grade levels. Pupils at the junior high school level may well have prolonged 
the task in order to remain away from their regular rooms. In this study 
as in the previous one, the pupils working on the task alone showed a greater 
decrement over days than those working in pairs which further substantiates 
the position that social interaction has high value for maintaining behavior. 

The data across the five grades from the 4th to the 8th shows no 
difference between the groups as performance was measured at the end of 
each week. The difference lies only in the speed with which the pupils 
are able to master the task. Since this conclusion was derived from a rote 
learning task there should be no generalization to problem“Solving tasks. 

The latter point is made since some psychologists have recently taken the 
rather reckless position that all mental operations can be performed at 
all age levels. 

Chapter VII investigates the problem of the extent to which those 
learning by observation, the vicarious learning group, are able to use 
negative and positive information. The nature of the task was such that 
an error on the part of the person overtly responding provided exactly as 
much information as a correct response. The expectation was that those 
learning vicariously, without overtly responding, would learn more from the 
correct responses of the responding subjects than from the wrong responses. 
The data, taken as a whole, do not give support to this hypothesis. Indeed, 
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no consistent trend in the data over the two studies could be found. 

Chapter VIII considers a different problem from that which was con- 
sidered in the earlier chapters in this section. While the data of the 
earlier chapters generally supports the applicability of the reinforcement 
model to I earn Inc simp|^e materials in the classroom, the model would appear 
to have limitations. In this last chapter, a study was undertaken of 
the reinforcement of a class of responses which were such that one would 
not expect the reinforcement model to apply. Several such categories can 
be identified. One category of such responses would be fast and accurate 
responses such as must be made by the musician. Generations of music 
teachers have discovered that speed of performance is not achieved by 
reinforcing speed. On the contrary, speed is acquired through slow and 
accurate practice of the responses to be performed later rapidly. Speed comes 
automatically when the associations between the performance of successive 
notes and chords have been strengthened to the point where flawless accuracy 
of performance is achieved. 

The category of responses selected for reinforcement In Chapter VIII 
were unusual responses of the kind that are manifested on the Guilford 
Unusual Uses Test . Although many contemporary psychologists and educators 
have assumed that the development of creative behavior requires only that 
the teacher reinforce this behavior and make it in some way rewarding, 
such a theory lacks plausibility. First, a careful examination of previous 
studies indicates that any such effect previously found may be nothing more 
than an experimental artifact. Second, the ability to produce unusual 
responses would appear to be highly dependent upon the individual's 
cognitive structures from which they are generated. Perhaps the problem 
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of facilitating the generation of unusual responses is not so much a 
matter of arranging for the pupil to want to make such responses as it is 
a matter of providing him with the necessary internally stored knowledge 
from which such responses are derived. One disturbing possible source of 
artificial results in previous experiments is that the same person was 
generally used as both trainer and also the administrator of the criterion 
tests. It seems reasonable to suppose that training may have consisted of 
nothing more than the student discovering what the trainer wanted when 
he asked for an unusual use of an object or an original response. 

The study undertaken on the reinforcement of unusual responses was 
planned so that the devices used to determine whether reinforcement had 
produced an increase in the ability to produce original responses were 
adminstered by the teacher as a part of the regular class routine. The 
training tasks were administered by the experimenter and were understood 
by the children to be part of a study. The training task required the 
subjects to think of unusal uses of common objects - ten objects a day for 
15 days. The main evaluation tasks were derived from the work of Torrance 
and involved the interpretation of colored slides. 

No training effect was found, which is exactly what was expected. This 
result raises doubts about the validity of the work of other research workers 
who have not controlled some of the conditions in which an apparent training 
effect appears as an artifact. 

Substantial correlations were found between the measures of creativity 
and measures of achievement. These correlations are hardly surprising. The 
child who obtains a high score on any of the tests of creativity included 
in the study is likely to display on the same test a high level of knowledge. 
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Sndeed, the development of the child's knowledge would appear to be the key 
to the development of creativity as it was measured by the tests. Reinforce- 
ment would not appear to be the key to the problem in terms of the data 
which are presented in this report. 

3n addition, very little evidence could be found for the belief of 
some psychologists that teachers discriminate against the creative child. 

On the contrary, the data came nearer to supporting the position that teachers 
favor such children and tend to give them high grades. 



SECTION III 



STUDIES COMPARING DIFFERENT REINFORCING 



CONTINGENCIES IN CHILDREN 



Introductory Statement to Section III 



If the reinforcement paradigm can be applied to the planning of 
human learning situations, then it becomes very important to know the 
reinforcing value of different stimuli. While there are many prejudices 
current in education concerning this matter, knowledge is still meager. 
Typical among the widely held views of persons connected with education 
is the view that human beings are particularly effective reinforcers of 
human behavior. The theory underlying this point of view is that child- 
ren seek approval and that hence approval can be used to control human 
learning. If the young human is an approval-seeking organism then 
approval might well be expected to have a high reinforcing value. Such 
a conception of the events thi t are most reinforcing for human behavior 
has very important implications for the planning of education. Insofar 
as the position is sound, one would expect that direct approval by a 
teacher would, then, be more effective than the very indirect approval 
provided by a teaching machine. The importance of this problefn has 
led the staff of the project to give consideration to the study of 
approval as a reinforcer in contrast to the reinforcement provided by 
physical events which are not directly symbolic of approval. The first 
chapter in this section of the report represents a study which attempts 
to compare approval with a physical reinforcing contingency which is 
not directly approval-related. 

A second view commonly expressed in educational circles is that 
any unpleasant condition is likely to depress learning. This view which 
has become identified with the progressive education movement repre- 
sented a reaction against the somewhat free use of aversive stimuli in 
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in many schools of the past. The arguments against the use of aversive 
stimuli are many and varied. While one is that aversive stimuli depress 
learning, other arguments against its use have also been advanced by 
educators on moral and ethical grounds. The position advanced by such 
arguments has been supported by psychologists, but on different grounds. 
Psychologists have commonly advanced as an argument against the use of 
aversive stimuli the fact that such stimuli elicit anxiety arid 
that anxiety thus may become conditioned to a whole range of common 
stimuli. An additional argument is that there is also generalization of 
the anxiety response to stimuli which are similar along some dimension 
to that which produced the anxiety in the first place. However, such 
arguments are generalizations from studies involving subhuman organisms 
which have been exposed to very severe shock. The fact is that relatively 
little is known concerning the effect of less intense aversive stimuli 
on human learning. For this reason two studies have been conducted which 
compare the effects of pleasant and noxious stimuli operating within 
the framework of the reinforcement paradigm. These studies are reported 
in the second chapter of this section. 
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SECTION III 



CHAPTER I 

EFFECTS OF DIFFERENT REINFORCERS: 
A COMPARISON ACROSS 
AGE LEVELS 
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This study provides a comparison of the relative effectiveness of 
two reinforcing conditions, referred to as verbal and physical, as they 
effect responses made by boys and girls to a knob" p res s i ng task. The 
hypotheses state that there are differences in the reinforcing value of 
the two reinforcing events which are related to the age and sex of the 
subjects . 

The two classes of reinforcing events considered in the study are 
approving statements made by the experimenter (E) , which are termed 
verbal reinforcements, and the delivery of a marble by the apparatus, 
termed physical reinforcement. 

While the study limits itself to the two specific reinforcing con- 
tingencies described, the problem is of interest because of a dearth of 
information concerning the relative reinforcing value of different 
events. Some of the problems related to the use of teaching machines 
cannot be solved until much more is known about the reinforcing value 
of different classes of events with different age groups. 

The study is concerned with two reinforcing events only. However, 
the ultimate concern of this and similar studies is to identify cl asses 
of events which have particular value for reinforcing human behavior. 

Some General Conditions Involved in Reinforcement 

Studies wi th Ch i 1 dren 

The concept of reinforcement implies a stimulus event which will 
maintain or increase the strength or frequency of a response. Most 
reinforcers that are effective in the shaping of human behavior are 
only very remotely related to primary reinforcers. These reinforcers 
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are called secondary reinforcements and they derive their power from the 
fact that each symbol that reinforces represents a class of situations 
which in themselves have highly reinforcing properties. Such symbols 
as are used as reinforcers may represent different classes of events 
for different persons. The child who solves a problem and is reinforced 
by the teacher's announcement of "good," may interpret this announcement 
as a sign of personal achievement or as a sign of having achieved status 
in the group. 

Money, grades, toys and marbles have served in previous studies as 
physical or material symbols which have reinforcing properties and which 
hence increase frequencies of response. Saying "good" or "that's good" 
are obviously successful as verbal symbols of approval in both verbal 
and motor tasks. 

Reinforcers are related to behavior by certain laws as are other 
aspects of the environment. Meehl (1950), for example, states that 
most reinforcers for operant behavior are trans-s i tuat ional . That is, 
if an object or event has proved successful in strengthening responses 
in an organism in one situation, it will strengthen all or most learnable 
responses in the same organism. The term operant behavior is used here 
merely to distinguish this type of learning or responding from that 
involved in classical conditioning. 

In terms of the trans-s i tuat ional law proposed by Meehl then, any 
of the preceding reinforcers should be relatively effective in bringing 
about an increase in response strength where a learnable response is 
employed. 

The criterion of the effectiveness of a reinforcement is a change 

s 
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in the strength of a response as a result of the reinforcing stimulus 
event. One method of quantifying the construct of strength and the one 
which is used in this study is through procuring a measure of the fre- 
quency of response. 

In order to measure and compare different reinforcers across age 
levels, the same type of device must be utilized with all age groups. 
The device must surmount the barrier imposed by differences in language 
and reading skills and must afford a basis for comparison of responses. 
Manipulative or mechanical apparatus have typically been selected to 
meet the imposed criteria. 

Terrell (1958), in comparing performance at two age levels, used 
a "greater than" concept task where the subject was to choose the 
larger or smaller of two similarly shaped objects, i.e., between two 
cubes, squares or triangles placed before him. 

Another commonly used device involves an apparatus where knob- 
pressing results in the delivery of one or more types of reinforcement. 
These devices are generally variations on a theme and differ with the 
design of the study and the desires of the experimenter. Jones and 
Liverant (i960), for example, designed the exterior of their two- knob 
device with a clown's face from whose mouth reinforcements could be 
dispensed. McCullers and Stevenson (i960) used a device which had 
three centra 1 ly- 1 ocated knobs, designated to deliver different kinds 
or combinations of kinds of reinforcements to which the subject could 
respond. A device of the latter type affords a basis for comparison of 
responses to different stimuli and can likewise be utilized across 
different age levels without an alteration of the task. 
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Incent i ve I terns 

j 

Studies of reinforcement in children commonly provide the child 
with a small gift which he knows, in advance, he is to receive if he 

I 

j: 

performs well on the task. For example, McCullers, ^ (i960) and | 

i : 

also Kuenne (1956) provided small toys which each child was told he 

i ^ 

was to receive and which were visible while the task was being performed. 

In some situations, such an over-all incentive item is unnecessary, as 

i ’ 

in the study of Alberts and Ehrenfruend (1956) in which gumdrops were i 

used as the reinforcing event. I 

|: 

The incentive items used are generally those towards which children 
of the particular age group used show a stong positive response. Such 
items themselves would almost certainly function as reinforcers, but 
their function in the particular setting is to maintain an orientation 
towards the task. Toys and candy have been well-established as incen- 
tive items. 

A range of suich incentive items from which the child chooses seems 
to be a good arrangement. Brackbill and Jack (1958) demonstrated that 
individual preferences rather than sex or age per se determine the choices 
of children for such rewards. When these investigators allowed five- 
year old boys to choose marbles or candy or trinkets, each one 
of the three was chosen an equal number of times. 

Results of Prev'i ous Studies 

McCul lers ^ aj,. (i960) utilized the knob-pressing apparatus pre- 
viously described to yield a comparison between subjects three to five 
years of age and persons eight to ten years of age. A marble reinforcement 
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was available to the subjects upon the depression of any one of the 
three knobs. Coupled with the latter physical reinforcement on one 
knob was a verbal comment denoting approval, such as, "That's good." 

The effectiveness of verbal reinforcement would be demonstrated by an 
increase in frequency of response to the knob delivering both physical 
and verbal reinforcement. Their hypothesis that the younger group would 
be reinforced to a greater degree by verbal reinforcements than would 
the older group was confirmed (£ = .01). The older group distributed 
their responses more uniformly across the differently reinforced res- 
ponses and a _t test revealed no significant increase in response rate 
to the knob that delivered a verbal reinforcement as well as the marble 
which all knobs delivered. 

Jones (i 960 ) investigated another factor, namely, the ability 

to discriminate among the percentages of rewards available. Their study 
supports the preceding findings of McCu 1 lers _aj[. (I 96 O) and offers 
supplementary information concerning the variability of response phe- 
nomena . Their subjects were aged four to six and nine to eleven. An 
apparatus with two knobs was employed. The percentage of reinforcement 
on one side exceeded that on the other. Two schedules were established. 

One schedule dispensed 90 per cent reinforcement on one side and 10 per 
cent on the other; the second allowed for 70 and 30 per cent reinforcement. 

It was hypothesized that the younger group would respond almost 
100 per cent to the side delivering the larger percentage of reinforce- 
ment while the older group would distribute their responses to both 
sides more nearly in accord with the percentages of reinforcement 
available. If the distribution of reinforcements to the two knobs were 
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reversed, the older children would be expected to discriminate the 
change sooner and respond accordingly. 

Both hypotheses were confirmed which Indicates that In children 
nine to eleven,, an ability to discriminate between percentages of 
available reinforcements affects the responses which they make to each 
possible choice. 

A study by Stevenson and Cruse (1961) compared responses made to 
a marble- Insert Ion task by five and twelve year olds under four condi- 
tions. In one condition, the experimenter was present and attentive 
to the subject but made no comments; In another, the experimenter was 
In the same room but was not In close proximity to the subject nor did 
he communicate with him; In a third Instance, comments denoting approval 
were delivered; a fourth condition Involved the delivery of critical 
comments by the experimenter. 

The findings showed that the twel ve-year-ol d group performed at a 
lower level than did the five-year old group under conditions of rein- 
forcing comments and attention from the experimenter. Stevenson^ al. 
( 1961 ) Interpreted these findings as indicating less dependence on 
adult approval among twelve-year olds than among five-year olds. 

Page (1958) utilized the comparative method In exploring the 
relative effectiveness of three methods of distributing grades and the 
reinforcing contingencies Involved. He employed the performance of 
two groups composed of males and females. .. one of junior high school 
age and the other of senior high school age. 

A letter grade Issued with no comment, a letter grade Issued with 
a specified teacher comment, and a letter grade issued with an appropriate 
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and spontaneous teacher comment (called a “free" teacher comment), 
comprised three reinforcing conditions which were being compared. 
Following the random assignment of these reinforcing conditions to the 
three experimental groups, the students received the grades which they 
had made on a specific test. On the next test administered to the 
groups, the test scores obtained by the students in the groups wherein 
grades had been distributed with specified and free comments were 
significantly higher (£<.0l) thah those obtained by the students in the 
no-comment group. A trend toward higher scores in the free comment 
group than in the specified comment group was shown but the difference 
was not statistically significant. Page states that the findings 
indicate a greater responsiveness to adult approval among the high 
school students than among the subjects of junior high age though the 
differences were not statistically significant. 

The studies of McCul lers e;L il.* (i960) and Stevenson ^ a_L (1961) 
demonstrate that at ages three to five, adult approval may be sought 
and will serve as an effective reinforcer; at ages nine to twelve, adult 
approval results in no greater frequency of response than do physical 
reinforcements and therefore is not considered a particularly effective 
reinforcer at this age level. 

Page's study (1958) indicates a tendency for adult approval to be 
more effective with subjects of high school age than with those of 
junior high age. Specific age levels were not defined by Page and in 
generalizing from his findings, some assumptions were necessarily made 
concerning the ages involved. 
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Interaction effects based on sex differences between the subjects 
and experimenters have been investigated at some age levels, particularly 
among pre-kindergarten and kindergarten students. The findings have 
been inconclusive but this dimension may have some relevance for this 
study. 

In a situation where the task was easel-paint ing, Gewi rtz (195^) 
obtained a measure of the degree of responsiveness to the experimenters 
through the frequency of attention-seeking behaviors and comments made 
by the subjects in the presence of the experimenter. With male subjects 
aged four to five and one-half, there was a significantly greater fre- 
quency of these responses made in the presence of the female experimenter 
than in the presence of the male experimenter. Female subjects likewise 
demonstrated a greater frequency of response in the presence of an 
experimenter of the opposite sex than with an experimenter of the same 
sex though the difference was not statistically significant. 

Gewi rtz (195^) postulated that the interaction he found might not 
obtain at other age levels and explained his findings in terms of Freud's 
theory regarding development during the phallic period and particularly 
at the Oedipal stage. In its simplest form, the Oedipus complex implies 
a positive drive toward the opposite- sex parent with accompanying feel- 
ings of hostility toward the parent of the same sex. It is theorized 
that boys maintain the mother or female as an object to be chosen over 
the father or male until the latter part of the fifth year; girls trans- 
fer their choice preference from the mother to the father around age 
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utilizing kindergarten subjects (ages five years, three months 
to five years, seven months), Rosenblith (1959) found more learning 
occurred with male experimenters than with female experimenters with 
both ma 1 e and f ema 1 e subject s . The d i f f e re nee was statisti ca 1 1 y s i gn i ~ 
ficant only between the female subjects and the male experimenter. In 
this study, the Porteus-maze task was learned by imitating the leader. 

The findings of previous studies support the following conclusions: 

1. On a simple knob-pressing task such as was used by McCullers 
e^aU (i 960 ) there is a tendency for approval to increase frequency 
of response in the three to five year olds but not in older children. 
However, older children respond to other features of the task which 
the younger do not --a point brought out by Jones et a 1 . (i960). Some 
further evidence of the effectiveness of adult approval in the case of 
the pre-school group was found in the Stevenson et a1 . (1961) study. 

The latter study used a marble insertion task rather than a knob- 
pressing task. 

2. Not all evidence suggests that approval is a poor reinforcer 
for children in the upper elementary grades. Page (1958) found evidence 
that a written comment of approval appeared to function as an efficient 
re inforcer for children, a little older, at the junior high school level. 
Perhaps a spoken comment of approval is less effective than a written 
comment because of the transitory nature of the spoken word. Approval 
may become increasingly effective through the teens, a matter requiring 
further study. 

3 . The findings suggest the existence of interactions between the 
sex of the experimenter and the sex of the subject in the effectiveness 
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of approval as a re inforcer. Gewirtz (195^) indicates that his data 
supports the contention that male subjects are more readily reinforced 
by an experimenter of the opposite sex at ages of four to five and a 
half. Girls aged five years, three months, to five years, seven months 
were also more readily reinforced by an pxper imenter of the opposite 
sex. 



The Present Study 

Previous studies have not provided a direct comparison of the 
reinforcing value of statements of approval in contrast to other kinds 
of events. The purpose of the present study is to determine the rela- 
tive reinforcing value of orally administered statements of approval 
in contrast with the reinforcing value of a physical event consisting 
of the delivery of a marble. In a sense, the delivery of the marble 
involves more than just the mechanical event, for the subject is also 
provided with the information that the delivery of a marble indicates, 
in some way, that his behavior is correct. In other studies where the 
effect of verbal ' approval has been introduced as a factor, such approval 
has occurred simultaneously with some other reinforcing event such as 
the delivery of a marble. 

The assessment of the reinforcing value of comments of approval 
in comparison with other mechanically administered reinforcing events 
provides knowledge of considerable importance for the design of class- 
room learning situations. Traditional teaching has depended largely 
on approval administered by the teacher as the main reinforcing event. 
The difficulties of administering such reinforcements on a sufficiently 
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frequent basis has been responsible for attempts to design machines which 
provide reinforcing contingencies. Such machines depend for their effi- 
ciency on some reinforcing event other than that of spoken approval. 

The reinforcing events provided by mechanical devices do indirectly 
symbolize approval of the teacher, much as the delivery of a marble in 
the present experiment symbolizes indirectly the approval of the experi- 
menter. 

The study involves the testing of the following hypotheses: 

1. Verbal reinforcement is more reinforcing than physical rein- 
forcement in pre-kindergarten children. 

2. Physical and verbal reinforcements are equal 1y effect ive for 
third grade children. 

3. A low level of dependence on adult approval is reflected in 
the sixth grade and junior high-school by the greater effec- 
tiveness of physical than verbal reinforcement. 

4. Verbal reinforcement is more effective at the pre-kindergarten 
than the third grade level. 

5. Approval is more reinforcing among females in kindergarten 
and third grade than among males at the same age levels. 

6. Teacher's rankings on need for approval correlate with respon- 
siveness to verbal reinforcement. 

Method 

Sub iects 

The _Ss were 100 males and 100 females selected on the basis of 
chronological age and sex. Twenty males and 20 females were selected 
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within each of the following five age ranges: (a) 3-11 to 5-8, (b) 

5“9 to 6-6, (c) 8-11 to 9“7» (d) 12-5 to 13“3» and (e) l4-8 to 

15“8, The age levels are representative of students in pre-kindergarten, 
kindergarten, and grades three, six and nine. The _Ss in the pre- 
kindergarten group were obtained from private nursery schools and the 
remainder from two public schools in metropolitan Salt Lake City.' 

Apparatus 

A device which delivered different kinds of reinforcement upon 
the depression of each of four knobs was selected as a task which could 
be used at the various age levels (see Fig. 1.01). The turquoise panel 
which fronted the apparatus was a square, 23 1/2 inches across. Centered 
8 1/4 inches from the top edge was a round, red light which served as 
a signal to _S for another trial. The light was turned on from behind 
the panel and extinguished automatically following the depression of a 
knob. The aluminum knobs were centrally placed 7 1/2 inches below the 
red light. A glass-covered, enclosed box located 3 1/2 inches below 
the knobs served as a receptacle for the marbles which were dispensed 
upon the activation of a solenoid behind the panel. The contents of 
the receptacle were not accessible to ^ but entry of the marbles into 
the box was audible and the marbles were visible. 

Procedu re 

The _Ss were tested individually in an experimental room established 
in the school plant. The JE addressed each _S by his first name and gave 
these instructions: 

'The writers express appreciation to the faculties of the schools 
involved and to the students involved for their cooperation. 



Fig. 1.01 Arrangement of apparatus with experimenter at the left 
and the subject at right. 
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" this is a game. I can turn a light 

on like this (light on), and you can turn it off (depress 
knob). Now, you try it (light on). Now, try another one 
(light on). Try all of them. (Allow a total of four 
trials.) But not all of the knobs are the right knob. 
Sometimes I will tell you that you have pushed the right 
knob. I'll say 'good' or 'that's good.' Sometimes you'll 
get a marble here (pointing) when you push the right knob. 
If you push the right knob most of the time, you may choose 
one of these toys (point to display) when we're through. 
Now, how will you know if you've pushed the right knob? 
(Continue with this until they say 'get a marble' and 'you 
say good. ') " 



Practice was given under actual reinforcement conditions until _S 
had depressed each of the four knobs one time. Following this, ^ was 
given 120 trials. 

The conditions assigned to the four knob positions were marble 
only, verbal, marble and verbal simultaneously, and no reinforcement. 

To counterbalance knob position effects, the conditions were assigned 
across the knob positions so that each reinforcement was delivered from 
each knob position an equal number of times within each age group. 
Reinforcement was delivered on 50 per cent of the trials and was assigned 
so that it occurred in random, rather than systematic order. 

Incentive items were provided for participation and were displayed 
approximately six feet from _S. A bag of candy, a toy suitable for a 
male, one suitable for a female, and another which might be chosen by 
either sex constituted the four- item display. Objects which were appro- 
priate to the age of the _S, were selected at the four age levels. The 
objects were held constant within each age group. 
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Resu 1 ts 

In order to function as reinforcers, the reinforcing stimulus 
events must maintain or increase the frequency of the response upon 
which it is contingent. Figure 1.02 shows the mean number of responses 
made to the various stimuli by blocks of twenty trials for the total 
sample. 

The responses made at various age levels to the four treatments are 
presented for comparative purposes in Figure 1.03. Figures 1.02 and 
1.03 demonstrate: 

1. Response in the absence of the two re inforcers provided does 
not extinguish even for this relatively long task. 

2. Response in the absence of the two reinforcers provided 
decreases as a function of age with the greatest frequency of response 
in the pre-kindergarten group and the least in the junior-high group. 

3. In the total sample, the frequency of response in the presence 
of approval reinforcement decreases in magnitude in the last block of 
twenty trials from the frequency observed in the first block of twenty 
trials. The difference is not statistically significant (t = 0.7^). 

4. A decrease in response to approval reinforcement across the 
blocks of trials is noted in the kindergarten and third grade group; 

in the sixth grade and junior-high groups, the frequency of response is 
maintained at approximately the same level when comparing the first 
block of trials with the last; in the pre- ki ndergarten group, an increase 
in response is noted, 

5. The mean number of responses in the presence of marble-verbal 
and marble reinforcement are typically greater in magnitude than are 



Mean number of responses 
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Number of trials 

Figure 1.02 Mean number of responses for all male and female 
subjects for each block of 20 trials. 
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Figure 1.03a Mean number of responses for male and female subjects 
in the Prekindergarten group. 
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Number of trials 

Figure 1.03b Mean number of responses for male and female subjects 
in the Kindergarten group. 
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Number of trials 

Figure n«0 3c Mean number of responses for male and female subjects 
at age level 8-11 to 9"7 (third grade group). 




Number of trials 



Figure. 1 . 03 d Mean number of responses for male and female subjects 
at age level 12-5 to 13 " 3 . (sixth grade group). 
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Figure l,03e Mean number of responses for male and female subjects 
at age level l4-8 to 15"8. (ninth grade group). 
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the mean number of responses made in the presence of verbal stimuli. 

The only exception is found in the pre-kindergarten group, 

6. The greatest amount of learning is shown by Ss in the two 
oldest groups where fewer responses occur in the absence of reinforce- 
ment. 

The learning curves of the five groups provide very clear evidence 
that physical reinforcement is much more effective than the verbal. 

Table 1.02 provides an F-test to determine the significance of the differ- 
ence between reinforcing contingencies. The difference between the 
effects of the reinforcing conditions have a very irigh degree of confi- 
dence attached to them, it should be noted here that there are diffi- 
culties involved in undertaking analyses of variance on the complete 
data since restrictions are introduced by the fact that the total number 
of responses is always 120. This restriction becomes less troublesome 
when the data from the two main reinforcing conditions are involved as 
is the case in subsequent analyses of variance. 

The basic data on which the following analyses of variance are based 
is given in Table 1,01, Table 1.03 presents a 2 x 2 x 5 analysis of variance 
involving the two major reinforcing conditions, two sexes, and five age 
groups. A highly significant main effect related to reinforcing condi- 
tions is evident. An interaction between reinforcing conditions and sex 
is significant at almost exactly the five percent level. This latter 
effect appears to be mainly due to two factors: (a) the responses of 

the pre-kindergarten children to the task and (b) the responses of 
males to physical reinforcement. The differences between boys and girls 
at the pre-kindergarten level are shown in the learning curves presented 
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Table 1.01 



FREQUENCY OF RESPONSE TO EACH STIMULUS 



FOR 


20 BOYS 


AND 


20 GSRLS 


AT EACH AGE 


LEVEL 








Verbal 




Marble 




Marble 

Verbal 




Nothing 




Pre-Ki ndergarten 


















Male 


546 




653 




627 




574 




Female 


621 




603 




633 




543 




Sub Total 




1 167 




1256 




1260 




1117 


Kindergarten 


















Male 


522 




708 




626 




544 




Female 


602 




651 




623 




524 




Sub Total 




1124 




1359 




1249 




1068 


Third Grade 


















Male 


563 




684 




662 




491 




Female 


536 




653 




689 




522 




Sub Total 




1099 




1337 




1351 




1013 


Sixth Grade 


















Male 


548 




638 




709 




505 




Female 


605 




647 




675 




473 




Sub Total 




1153 




1285 




1384 




978 


Ninth Grade 


















Male 


548 




656 




703 




493 




Female 


579 




632 




681 




508 




Sub Total 




1 127 




1288 




1384 




1001 


Totals 


















Male 


2727 




3339 




3327 




2607 




Female 


2943 




3186 




3301 




2570 




Totals both sexes 




5670 




6525 




6628 




5177 
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Table 1.02 

ANALYSBS OF VARIANCE FOR TOTAL SAMPLE 





df 


Ms 


F 


P 


Between Reinforcing 
Cond i t i ons 


3 


2427.06 


42.78* 


<.001 


Within 


796 


56.73 






Total 


799 








'‘Significant beyond .001 


1 eve 1 . 









Table 1.03 

ANALYSIS OF VARIANCE OF VERBAL AND 
PHYSICAL REINFORCING CONDITIONS 





df 


Ms 


F 


P 


Reinforcement 


1 


1827.56 


34.68 


<.001 


Ages 


4 


8.68 






Sex 


1 


9.925 






R X A 


4 


52.91 






R X S 


1 


340.41 


6.46 


.01 


A X S 


4 


25.40 






R X A X S 


4 


39.08 






Within 


380 


52.69 







Total 



399 
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in Figure 1.04. The girls show a clearly greater responsiveness to ver- 
bal reinforcement than do the boys. Males respond with higher frequency 
to physical than to verbal reinforcement at all age levels. 

A 2 X 3 X 5 analysis of variance (Table 1.04) was performed includ- 
ing three of the reinforcing condit ions. . .verbal , physical ^ and physical- 
verbal. There was, again, a difference among the conditions of reinforce- 
ment significant beyond the .01 level. A reinforcement by sex interaction 
was significant at approximately the .05 level (F obtained = 2.89; JF 
required = 2.99). Scheffe's test among the means of the reinforcing 
conditions in this analysis demonstrated that physical reinforcement 
alone was significantly more effective than verbal alone. A comparison 
of the mean of the physical reinforcement with the mean of the verbal - 
physical combination was also performed in an effort to determine the 
effect of providing verbal and physical reinforcement simultaneously. 

No significant difference between the two indicates that, statistically, 
physical reinforcement alone is as effective as physical reinforcement 
when it is offered simultaneously with verbal reinforcement. 

At the lower age levels significant differences were found between 
the number of responses made by males and females to the knob reinforced 
with approval. The differences in the mean number of responses for 
males and females for the pre-kindergarten and kindergarten age group 
are shown in Table 1.05. Consistent with previous work, the signifi- 
cant differences are at the kindergarten and pre-kindergarten levels. 

The learning curves from which these differences are derived are 
shown in Figure 1.04. 
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Mean no. of responses Mean no. of responses 



1.25 




Number of trials 

Figure' 1*o4a Total mean responses of male subjects (N=20) . Pre- 
kindergarten group. 




Number of trials 

Figure’ l,04b Total mean responses of female subjects (N=20) . Pre- 
kindergarten group. 
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Figure ! 1 ..04c Mean responses of male subjects in the Kindergarten 
group . 




Number of trials 



Figure K04d Mean responses of female subjects in the Kindergarten 
group. 
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Table 1.04 

ANALYSIS OF VARIANCE OF VERBAL, PHYSICAL AND 
PHYSICAL- VERBAL REINFORCING CONDITIONS 





df 


Ms 


F 


P 


Reinforcement 


2 


1382.83 


22.58* 


<.001 


Ages 


4 


26.46 






Sex 


1 


2.28 






R X A 


8 


72.68 


1.19 




R X S 


2 


175.72 


2.87’’-v 


.05 


A X S 


4 


6.87 






R X A X S 


8 


35.93 






Within 


570 


61.24 


- 




Total 


599 









*Sign if leant beyond .001 level. 
'wVNeed 2.99 at .05 level. 






The latter finding does not necessarily imply that responses to the 
verbally reinforced knob are more frequent at the lower than at the 
higher age levels. In actual fact there is no significant difference 
between the mean number of responses to the verbally reinforced knob of 
the pre-kindergarten and the third grade group (t = 1.33). However, when 
the same comparison is made using only females, a difference is found 
which is significant at the .01 level of confidence, as is shown in 
Table 1 . 06 * 



Rated Need for Approval of Pupi 1 s and Responsiveness 

to Verbal Reinforcement 



The teachers of the school children included in the study were 
asked to rank the children in their classes for their need for approval. 
The directions given to the teachers in making the rankings were as 
fol 1 ows: 



“Please rank the individuals below according to the 

way they respond to approval. There are names included 

in the list. Number 1 will signify the individual whom 
you feel is most responsive to approval or praise when com- 
pared with the other names on the list. The person whom 
you feel is next most responsive to approval will be number 

2, and so on down the list. Number will signify the 

individual among the group whom you feel is least responsive 
to praise. 



The rankings thus obtained were converted to normalized standard 
scores. All rankings were then pooled and a correlation computed between 
ranking and frequency of response to the knob reinforced with verbal 
approval. A Pearson £ of .005 indicates a zero overfall relationship 
between the two variables. However, such a pooling of data is not 
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Table 1,05 

MEAN FREQUENCY OF RESPONSE OF MALES AND FEMALES TO THE 
VERBALLY REINFORCED KNOB AT ALL AGE LEVELS 



Age Group 


Mean Frequency 
of Response 

Male Female 


t 


Pre-Kindergarten 


27.3 


31.05 


2.03* 


Kindergarten 


26.1 


30.1 


2.78 


Third Grade 


28.15 


26.8 


1.07 


Sixth Grade 


27.4 


30.25 


1.79 


Ninth Grade 


27.4 


28.95 


1.15 


’‘Requires t of 2 


.086 at .05 level. 








Table 1. 


06 




COMPARISON OF FREQUENCY OF RESPONSE TO VERBALLY REINFORCED 




KNOB 


FOR PRE- KINDERGARTEN AND THIRD GRADE CHILDREN 






Pre-Ki ndergarten 


Third Grade 






Mean 


Mean 


t 


Female 


31.05 


26.8 


2.84* 


Male 


27.3 


28.15 


0.43 



*p = .01. 
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entirely ressonable for pupils were drawn all the way from kindergarten 
to the ninth grade. 

Additional information on the rankings is provided by age level in 
Table 1.07. Spearman rank-order correlations were computed, using the 
rankings made by the eleven teachers without converting them to standard 
scores. An inspection of the table reveals negative correlations in 
the group of kindergarten subjects. The only significant correlation 
occurs in the junior-high group and it is accompanied by another correla- 
tion in that group which is high and positive. 

Computing the correlations in this manner, even though N's are small, 
provides an opportunity for observation across age levels. In the group 
of kindergarten subjects, for example, the three correlations are negative, 
indicating perhaps the difficulty involved in discerning the student's 
need for approval or identifying those behaviors which suggest need for 
approval. In the group of junior-high students, however, patterns of 
behavior appear to be less random and more established enabling the 
teacher to make better judgments concerning actions showing need for 
approval. This is hinted by the correlations which are, as a group, 
higher than those in other groups. One of the correlations is significant 
beyond the .05 level, while another is relatively high and positive. It 
is of interest that these Vm rankings were made by the same teacher. 

The main findings may be summarized as follows: 

1. What has been termed here physical reinforcement is more 
effective at all age levels than what has been termed verbal reinforce- 
ment. The general appearance of the learning curves indicates that at 
the lower age level the response to physical plus verbal reinforcements 
is almost the same as the response to physical alone. At the higher 
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Table 1 .07 



SPEARMAN RANK-ORDER CORRELATIONS BETWEEN TEACHER'S 
RANKING ON NEED FOR APPROVAL AND RESPONSE 
TO VERBAL STIMULI 



Group 


Rho 


Ki ndergarten 


-.036 


Kindergarten 


-.518 


Kindergarten 


-.352 


Thi rd Grade 


.334 


Third Grade 


.085 


Thi rd Grade 


-.10 


Sixth Grade 


.345 


Sixth Grade 


-.543 


Sixth Grade 


-.026 


Sixth Grade 


.045 


Ninth Grade 


-.096 


Ninth Grade 


.417 


Ninth Grade 


.112 


Ninth Grade 


.694* 



*Signif leant at =05 level. 
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age levels, there is the suggestion of an additive effect of the two 
reinforcing modes, however, there is no significant difference. 

2. Verbal reinforcement is more effective with females than 
males at both the pre-kindergarten and kindergarten age levels. 

3. In comparing the responses of pre-kindergarten Ss with those 

in third grade group, the females showed a significantly greater number 
of responses to verbal reinforcement at the lower age level while there 
was no significant difference in the case of the males. 

4. Rankings by the teacher of the pupils in terms of the extent 
to which they are judged to show need for approval show no significant 
correlations with the response to approval as a reinforcer. 

Discussion 

Perhaps the most general finding of the study is the extraordinarily 
poor effect which approval has as a re inforcer compared with physical 
reinforcement except for girls at the pre-kindergarten level. This find- 
ing runs counter to the conception of reinforcement provided in most 
textbooks on elementary education which generally take the position that 
adult approval is perhaps the most powerful influence on child behavior. 
In the case of girls at the pre-kindergarten level this may be true, but 
it can hardly stand as a generalization. There would seem to be many 
reasons why approval may well be a much weaker reinforcing contingency 
than many writers suppose it to be. Some of these need to be given brief 




cons i derat i on. 



1.33 



First, approving remarks are very common events in the life of most 
children. Teachers can be observed whose stream of behavior in the 
classroom consists of a flow of comments or gestures of approval. The 
commonplace nature of such comments may reduce their value as a rein- 
f orcer . 

Second, since the behavior of pre-school girls is generally more 
acceptable to the mother than is the behavior of pre-school boys one 
might well expect that approval would be less effective for young boys 
than for young girls. 

Third, there is a possibility that expressions of approval toward 
children may be much more satisfying to the adult who gives them than 
to the child to whom they are directed. 

A second matter of interest is the extraordinary effectiveness of 
the physical reinforcement in this study. This effectiveness could be 
accounted for in a number of different ways. 

First, the delivery of the marble by mechanical means is a novel 
event. There is an overwhelming amount of evidence that novel events 
have reinforcing properties on the behavior of a wide range of different 
creatures. The novelty of the event may, in this particular case, 
account for its effectiveness as a re inforcer. 

Second, in the case of the delivery of a marble, the subject has 
the experience of acting directly on the environment and actually being 
the cause of an event. Such a relationship to the environment is con- 
sidered by many psychologists, including Woodworth (1958), to be one 
sought after by most higher organisms. 
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Finally, the point must be made that the highly sophisticated 
interpretations of the effect of reinforcers on child behavior, such as 
those which invoke Freudian concepts, would appear to involve constructs 
which are too remotely related to be of value at this time. A much 
simpler theory accounting for the nature of events that reinforce child 
behavior would appear to be needed. 

Summary 

The relative effectiveness of verbal and physical reinforcements 
were compared as a function of age and of sex in a knob-pressing task. 

The sample was composed of twenty males and twenty females at each of 
five age levels, representing pre-kindergarten, kindergarten and grades 
three, six and nine. A highly significant difference was found between 
the reinforcing treatments with physical being more effective than verbal 
reinforcement at all age levels. These results had been predicted in 
the two oldest age groups, however, hypotheses made for the pre-kindergarten 
and third grade groups were not confirmed. It had been predicted that 
verbal reinforcement would be more effective than physical with the pre- 
kindergarten group and the two reinforcements would be equally effective 
with the third grade group. Giving a verbal reinforcement simultaneously 
with a physical reinforcement resulted in no greater response than did 
the physical reinforcement alone. 

Pre-kindergarten females respond significantly more to verbal rein- 
forcement than did the third grade females, while no difference was 
demonstrated between male _Ss at the two age levels. 
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SECTION I I I 



CHAPTER 2 



COMPARISON OF THE REINFORCING EFFECT OF 
DIFFERENT STIMULI IN A PAIRED 



ASSOCIATE LEARNING TASK 
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Although strong aversive stimuli have been used extensively to con- 
trol the learning of subhuman subjects, relatively little research has 
been undertaken with the stronger forms of this class of stimuli as 
negative reinforcers of human behavior. 

The well-known study by Warden and Aylesworth (1927) provided results 
showing that rats learned a visual discrimination task nine times faster 
when rewarded (food) for correct responses and punished (shock) for in- 
correct responses than when they were rewarded alone. Although the 
results of the latter study have important implications both theoretical 
and practical in character, only within the last few years have there 
been attempts to compare the effects of similar conditions with human 
subjects. Brackbill and O'Hara (1958) attempted a similar experiment 
with kindergarten children to compare the effects of reward (candy) alone 
with the effect of a combination of punishment (taking away of candy). 

The results of the Brackbill and O'Hara study were in the same direction 
as those of Warden and Aylesworth but much less dramatic though statis- 
tically s i gn i f i cant . 

In another study by Penney and Lupton (1961) children of elementary 
school age received 60 trials on a two-choice discrimination problem. 

One group received rewards (candy) only. A second group was punished 
(98db tone) for incorrect responses. A third group was rewarded for 
correct responses and punished for incorrect responses. In this case 
the punishment alone condition led to faster learning than the punishment 
plus reward condition. An odd and unexplained fact about the latter 
study is that reward produced no observable learning effect while the 
conditions involving punishment did. Penney and Lupton explained their 



r 

L 



2.02 m 

results in terms of a drive resulting from the presence of the intense 
tone and the associated frustration. 

The studies which have been cited do not include the use of severe 
punishment such as may be produced by electric shock, which has been 
typically used in animal experiments. The effect of shock on the learn- 
ing of children is quite unknown. The studies also provide no information 
concerning the relative effects of shock as compared with some of the more 
conventionally administered signals such as "wrong" or "right." Knowledge 
concerning the latter is pertinent with respect to the issue of the 
reinforcing value of events which are strictly informational in contrast 
with the reinforcing events which also produce strong approach or 
avoidance behavior. Events which are typically considered as reinforcers 
of human behavior generally have high information content but do not 
produce strong approach or avoidance behavior when used out of context. 
While the word "right" is said to be a powerful reinforcer of behavior 
involving problem-solving situations, it is not likely to produce approach 
or avoidance behavior when it is emitted at random by a machine. In 
contrast, small pieces of candy are likely to produce vigorous approach 
behavior even when they are not contingent upon particular behaviors. 

The difference in the operation of these two classes of reinforcing events 
in the case of human learners has not been explored. 

In the two experiments that follow a comparison is made between three 
reinforcing conditions, the words right or wrong, a tone to indicate either 
correct or incorrect responses, and an electric shock to indicate either 
correct or incorrect responses. One hypothesis was that a reinforcing 
event such as shock, which has a high value for evoking avoidance 
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behavior, when contingent upon incorrect responses will produce more rapid 
learning than the word "wrong," which in turn will produce more rapid 
learning than the tone (which does not ordinarily produce avoidance 
behavior). In addition it was hypothesized that a reinforcer, such as 
shock, which produces avoidance behavior will be much more effective in 
producing learning when it is used as a signal for wrong than when used 
as a signal for right. It was further hypothesized that information given 
regarding right responses will be more effectively utilized than informa- 
tion given regarding incorrect responses. 

In addition, the Children's Manifest Anxiety Scale developed by 
Casteneda (1956) was administered to all subjects. It was hypothesized 
that the high anxiety groups would show less learning than the low anxiety 
groups and that the high anxiety subjects would show less learning under 
experimentally induced anxiety conditions than the low anxiety groups. 



EXPERIMENT I 



Method and Procedure 

In developing a procedure for testing the hypotheses, two major 
assumptions were made: first, a well-established procedure from which 

most of the technical pitfalls have been eliminated should be utilized 
and, second, the task should be one which bears a relationship to learning 
as it commonly occurs in schools. A paired associate learning task 
satisfies the first of these two criteria and it also simulates in many 
important characteristics typical school learning and recitation pro- 
cedures. In representative paired associate learning tasks, the procedure 
involves the following steps: 



1. Both members of the pair are presented and the subject reads 
them or otherwise responds to them. 

2. The first member is presented alone and the subject attempts 
to reproduce the second member of the pair. Reinforcement 
may or may not be immediately given. 

3. The subject is exposed to a new learning trial with both members 
of the pair present. 

This procedure is similar in many important respects to a typical 
recitation procedure which proceeds along the following lines: 

1. The pupil learns that the capital city of Spain is Madrid. 

2. The teacher asks the pupil "What is the capital of Spain?" 

(The teacher may then provide reinforcement with such words as 
"right" or "wrong," but she may skip to the third step). 

3. The teacher repeats "The capital of Spain is Madrid." 

The procedure which has been described permits learning regardless 
of the presence or absence of the reinforcers. Since learning in this 
situation is not dependent upon the occurrence of the reinforcing events, 
it seems desirable to also study the effects of the reinforcers in a 
situation in which learning was dependent upon the information provided 
by the reinforcing contingencies. The second experiment, described later, 
studied the effects of the different reinforcing conditions on a task 
in which learning could take place only as a result of the reinforcing 
events . 

Design 

The study involved three different reinforcing contingencies, an 
electric shock, verbal feedback, and a tone of approximately 400 cycles 
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per second. Reinforcement was provided either for right responses or for 
wrong responses. In addition, the data permitted an analysis of those 
scoring high on the anxiety scale in contrast to those scoring low. Thus, 
the variables formed the basis for a 2 X 2 X 3 factorial design. 

The subjects were randomly assigned to one of six groups: (1) shock- 

right, (2) shock-wrong, (3) tone-right, (4) tone-wrong, (5) oral-right, 

(6) oral-wrong. This placement was accomplished by having each subject 
draw a token from a small box. The first subject had six conditions to 
draw from, the second subject only five conditions, etc. The drawing 
continued, as each subject presented himself, until the tokens rep- 
resenting the six conditions had been depleted. After each group of six 
subjects, the tokens were returned to the box, and the drawing was begun 
again. This continued until six groups of thirty subjects had been filled. 

Tasks 

The tasks used in this study were essentially the same as those 
developed and first used by Ruch (1934). Ruch used the tasks as part of 
a large battery of tests comparing the learning ability of the aged and 
the young. Korchin and Basowitz (1957) used the tasks in a partial 
replication of the Ruch study. Two of the three tasks were also used by 
Korchin anJ Levine (1957) who considered the relationship of anxiety 
to verbal learning. Data compiled by both Ruch and Korchin gave infor- 
mation on how readily the lists were learned by young and old S s . Korchin 
also gave comparative information on the learning of these tasks by normal 
and psychiatric ^s . 

A preliminary run of subjects revealed that Task I was much too easy. 
The pairs were subsequently scrambled to reduce associative strength. 
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This was done in order that shock-right and shock-wrong modes of rein- 
forcement would not result in a highly disproportionate number of shocks 
for shock-right subjects. 



As 


used by Ruch, Task 1 was: 


As scrambled and 
study Task 1 was; 


1 . 


stem-bud 


1 . 


stem-chai r 


2. 


house-vi si t 


2. 


house-bud 


3. 


room-1 i ght 


3. 


room -owl 


4. 


soft-chai r 


4. 


soft-flag 


5. 


tree-flag 


5. 


tree -car 


6. 


nest-owl 


6 . 


nest-1 i ght 


7. 


whi te-pink 


7. 


whi te-vi si t 


8. 


walk-car 


8. 


wa 1 k-pi nk 


9. 


horse-sheep 






10. 


wi nd-brook 







Only eight of the ten original pair were used, as in the Korchin 
studies. The three tasks used in the study, including the modification 
of Task I just shown, are listed in Table 2.01. 

The tasks are presumed to differ in difficulty and novelty as well 
as in the degree to which prior experience influences present learning. 

The method selected for the experiment was paired associate learning. 
The subjects were required to respond to a stimulus word, letter, or 
number combination by anticipating a paired word, letter, or number. The 
pairs were presented in changed order, on successive presentations after 
the second, to prevent serial learning. 

Lists were presented using the memory drum operating at a three-second 
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rate with a three-second blank exposure between the stimulus and response 
presentation. A short time was allowed between tasks, with a one minute 
rest between Task II and the final one. 

TABLE 2.01 

PAIRED ASSOCIATE LEARN I MG TASKS USED IN THE STUDY 



Task I 



1 . 


stem-chai r 


2. 


house-bud 


3. 


room -owl 


4. 


soft-flag 


5. 


tree -car 


6 . 


nest=-l Ight 


7. 


whl te-vl si t 


8. 


walk-pi nk 



Task II 

1 . r X s = q 

2. h X y = j 

3 . b X d = m 

4. a X m = b 

5 . s X q = h 

6. k X u = t 

7 . w X 1 = s 

8 . 1 X b = d 



Task III 

1 . 5x5=11 

2. 3x1=1 

3. 3x3=4 

4. 3x4=2 

5. 5x1 = 7 

6. 6x3 = 5 

7. 2x5 = 8 

8. 2x4=9 



No practice list was used. The subject was presented the list the 
first time through and told he would be required to learn the second word 
of each pair. Responses were recorded for the second and subsequent 
trials . 

The lists were always presented to the subjects in the same order. 
While there would be advantages in counterbalancing the order, there is 
also an advantage in administering the simpler task first, particularly 
in those treatments Involving electric shock. For this reason the order 
of the task is constant, with the simplest of the three tasks given first. 
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Apparatus 

The learning lists were mechanically presented by a Gerbrand's memory 
drum, Model Ml-A. An elongated paper cylinder, on which were typed six 
orders of the eight pairs making up each learning task, was taped over 
each of the three drums arid was used with a large faceplate and sliding 
mask. This arrangement proved superior to the more usual endless paper 
tape presentation method which is often plagued with problems of synch- 
ronization, transport, and alignment. The paper cylinder allowed quick 
and easy changing of tasks and obviated synchronization, transport, and 
alignment difficulties. 

The device used to present the shock reinforcement is shown schemati- 
cally in Figure 2.01. This homemade shock source allowed the use of 
either rapidly pulsing or single-discharge shock stimuli. The former 
mode was used in this study as the young subjects were less fearful of a 
"tickle shock" than of a single condenser-discharge shock of the same 
intensity. The wave form and intensity characteristics of the shock 
stimuli are shown in Fig 2.02. 

The tone used as one reinforcement mode, was provided by a simple 
400 c.p.s. audio signal generator, shown schematically in Figure 2.03. 

This audio generator fed a ten watt audio amplifier, which in turn drove 
one of a pair of PDR-10 Permaflux headphones. The other phone was replaced 
with a small pad of gauze to keep the phone mount from irritating the 
subject's head. The level of the tone was set to provide a loud, but not 
aversive reinforcement condition. 

A third reinforcement mode using verbal reinforcement conditions was 
provided by the experimenter pronouncing the words "right" or "wrong" as 
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Fig, 2.01 Schematic diagram of shock- dev ice. 

Designed by A. B, Cullen. Science . 19^1, 94, 219-220. 
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used as reinforcing agent in the study. Instantaneous peak 
current flow of 4 ml 1 1 iatyiperes through skin of subject 
showing resistance of 25 Kil.. 



(Techtronix Model 585 oscilloscope used to display the wave 
form and to determine its temporal and intensity charac- 
teristics) . 



sec. not used 
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Fig. 2.03 Schematic diagram of fixed- frequency audio oscillator used as tone source. 
Design from Radio Amateur's Handbook, I960, 37, 315. 
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needed to produce the verbal -right or verbal -wrong reinforcement condi- 
tions. These reinforcement conditions were uniform for all subjects as 
a single experimenter handled all the subjects. 

Subjects 

One hundred-eighty fifth and sixth grade elementary school children 
served as subjects for the experiment. These subjects were obtained and 
tested in three locations in Salt Lake City, Utah, during the summer 
vacation months. Approximately equal number of boys (N=92) and girls 
(N=88) were tested . 

Subjects were selected on the basis of their willingness to partici- 
pate in the experiment utilizing paired associate verbal materials in which 
oral, shock, and tone reinforcement modes were used. Each subject was 
paid $1.00 at the completion of the learning session. 

An effort was made to control contamination by asking subjects not 
to tell others about the learning tasks or procedures. How effectively 
this precaution prevented contamination can only be surmised. 

Procedure Puri nq Acaui s i t ion 

As a subject came into the room where the experiment took place, he 
was seated in a chair to the experimenter's left. Both and E faced a 

long table on which the memory drum, the shock apparatus, the signal 
generator and audio amplifier, one of a pair of headphones with its head 
band, a stack of the forms used to keep track of the learning scores and 
a small box containing wooden tokens were situated. The tokens were used 
in the assignment of ^ to a particular treatment. A checklist of the 
items to be included in the explanations and instructions was on the 
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table, but was kept for reference only. Instructions and explanations 
concerning the apparatus and procedure were given in an informal manner, 
modified somewhat in form with each individual, as some S,s showed more 
curiosity about one aspect of the experiment, some about another aspect. 
Enough time was taken with each child so that he had his questions 
clarified regarding the entire procedure. This was necessary as the ages 
of the S^s ranged from 10 years three months to 13 years six months. This 
age range provided a large span as far as sophistication and confidence 
level was concerned. 

The following instructions were given to each subject before Task I; 



"This is one of several experiments being done by people 
from the University of Utah, We are interested in discovering 
which are the best ways to help children learn." 

"This device, called a memory drum, will automatically show 
you what is to be learned. There are three different learning 
tasks. The first one uses common words with which you are very 
familiar. See this little window? When I turn on the drum, 
the drum will revolve and a word will appear right here. Read 
the word aloud so that I can hear it. The word will disappear 
when the drum moves again. The window will be blank for three 
seconds and then the word you saw and read aloud will reappear. 
This time it will have another word with it. Read both words 
aloud and try to remember them as a pair. They belong together 
and you are to say the second one whenever the first of the 
pair appears. There are eight such pairs of words and you will 
have six trials or chances to learn them." 

"On the second time through these eight pairs, when you 
see the first word, read the word aloud. Then say the first 
word and the word that belongs with it before the two words 
come into view on the next turn of the drum." 

"Your score will be the total number of times you give 
the correct response word before it appears with the first word." 

"There are six ways you may be shown that you have given 
the correct or the wrong response. First, you may be given a 
tone in your ear every time you are right, or you may be given 
the tone only when you give wrong words or no word at all. A 
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second way you may be helped to learn these word pairs 
is to be told the word "right" for each correct word, 
or to be told "wrong" when you give the wrong word or 
do not give any reply. The third way you may be helped 
to learn the word pairs is by being given a "tickle-shock" 
on the tips of the fingers of your left hand for each 
right response. Shocks may be given for wrong responses 
also." 



"Let me show you these different ways of helping you 
learn. This is the tone (placing earphone over the left 
ear). The tone is loud, but not so loud as to hurt your 
ear. 0. K., let's take the ear phone off and see what the 
"tickle-shock" feels like. Put your fingers down on these 
two pieces of metal like this. (Experimenter demonstrates). 
These are the shock electrodes. The shock is unpleasant. 

This is how it feels." (Demonstration of shock) 

"You will learn using only one of the different ways, 
not all of them. If you think you would like to take part 
in this experiment, now that you know what it is all about, 
draw a token out of this box (experimenter shakes the tokens 
about in the box, holding it above the ^*s eye-level) and 
let us see which way you will be helped to learn the pairs 
of words ." 

"You drew S. W. That stands for shock-wrong. Now 
remember that each time you do not give the correct word, 
or give no response, you will be given a shock. Let us tape 
the electrodes on your finger-tips and begin." (Appropriate 
comments were given for the other five conditions), 

"Now we will turn on the drum. Watch the little window 
and read the word aloud as it comes into view. (E prompts 
the S as necessary on each of the eight pairs for the first 
viewing trial). Now, on the second time through the list, 
you will receive the shock for each wrong word, or for failure 
to give any word before the paired words appear." 



After each trial of the list, the sliding mask was moved over one 
position, exposing the same eight pairs, but in a changed order. 

All responses were recorded on a scoring sheet on which E underlined 
each correct response and recorded incorrect responses by writing them. 
Failure to respond at all was indicated by a check. 

At the completion of the six trials on task one, the drum's cylinder 
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Mi 



was replaced with a second one which was used to present task two. The 
same procedure, timing, and reinforcement were used. 

At the end of Task I the following instructions were given before 
administering Task Ih 



"You now know how the memory drum works and how we 
keep score. This next list is harder than the first one, 
so a lower score may still be considered a good score. Do 
the best you can." 

"This time you are to learn letters paired like this 
(E shows ^ an example of the nonsense equations, E x G = Z) . 
The E X G part will appear in the window first, just as the 
word did last time. Say aloud, E x G equals, then wait until 
the drum moves and shows you waht E x G does equal . Then 
repeat the whole combination aloud. This is called a nonsense 
equation. You are to learn which letters go together just as 
you learned which words were mates." 

"! will turn the drum on and you read the letters aloud 
so that I can hear you. Ready? (E prompts _S when needed in 
order to get correct response patterning to occur). Now, the 
second time through the list, I will keep your score." (Second 
task is presented). 



At the end of Task II, E gave the following instructions; 



"That is fine. Let us total up your score for that one. 

One more list and that completes the learning part of the 
experiment. This last list is made up of interference equa- 
tions. For example, you will learn something like 2 x 5 = 25 . 

The answers are wrong, but learn them anyhow. Pronounce the 
first item that appears in the window aloud. Then, when the 
entire equation is visible, say both the part in front of and 
after the equal sign. Remember these pairs belong together as 
did the words and number combinations." 

"Ready? (E prompts the subject to help establish response 
pattern as in Task I and Task II)." 

"That is all of the learning tasks. Let us total your score." 

"I would like to know a little more about how you feel 
in various situations. Before you go, would you mind marking 
these two pages of statements for me. (The statements referred 
to here were the items of the Children's Manifest Anxiety 
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Scale), it may be hard to make up your mind, but in each 
case the statement will be mostly, or most often true of 
you, or mostly, or usually false. Mark a T in front of 
those statements that are mostly, or more often true for 
you and an £ for mostly false. When you have finished both 
pages, and a few questions at the bottom of the last sheet, 
you are through. Now, if you will please sign the pay sheet, 
I will give you your dollar." 

"Thank you for helping me. Be sure not to tell other 
children any of the things you have learned. Let them come 
and find out for themselves. If you tell someone else an 
answer, this may damage the usefulness of the experiment 
and we do not want you to have done all this for nothing." 



A number of subjects expressed fear and disinclination to receive 
the shock reinforcement, but drew a token none the less expressing relief 
whenever alternative reinforcing modes were drawn. No subject actually 
refused to participate after he had presented himself at the experimental 
room. However, as word of mouth advertising was used to secure subjects, 
extremely fearful subjects may have failed to appear at all. The experi- 
menter asked each subject to provide one additional subject. Most subjects 
complied with this request. 



Results 



All scores for performance during acquisition are the number of 
correct anticipations during six trials. Tables 2.02, 2.03, 2.04 and 
2.05 give the means and the standard deviations for the various subgroups 
for each of the tasks and for the total scores derived from each task. 

The performance of the various subgroups gives the appearance of high 
homogeniety from treatment to treatment. The large differences found 
in studies with animals which compare electric shock with nonaversive 
reinforcers certainly 




are not apparent. 
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The distribution of anxiety scores for the 180 S^s who served in this 
study is shown in Figure 2.04. The distribution of the scores in this 
study is very similar to that derived from Castaneda's (1956) original 
standardization data also shown in the same Figure. The mean of our 
sample appears to be a little higher. 

Frequency polygons showing the distribution of CMAS scores for boys 
and for girls appears in Figure 2.05. The distribution for the sexes 
indicates a somewhat higher mean value for the girls. Castaneda (1956) 
also reported a general tendency for the girls to receive higher anxiety 
scores . 

Only one form of the CMAS is available. In order to provide an 
indication of its i nternal -consi stency , Kuder and Richardson's formula 
KR21 was applied. This formula provides a coefficient of equivalence 
not unlike that derived by split-half techniques often used in determining 
i nterna 1 -consi stency measures when equivalent forms of a test are not 
available. Cronbach (I960) has discussed the usefulness as well as the 
limits of this Kuder-Richardson formula. In the present consideration of 
the CMAS, the derived coefficient of equivalence was found to be .75. 

This closely approximates reported test-retest reliabilities of the CMAS 
which Smock (1958) reports to range from .61 to .65, depending on the age 
and sex of the children to whom it has been given. Smock (1958), citing 
an unpublished study, reported that with seventh and eighth grade subjects 
he found slightly higher test-retest reliability than those noted above. 

The relationship of anxiety to learning is apparently a. complex one. 
In studies by Matarazzo and Phillips (1955) and Matarzaao et al., (1955), 
a curvilinear relationship was postulated as existing between performance 
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Midpoint of Score Interval 

Fig. 2.04 Frequency polygon showing percentage of 180 children re- 
ceiving the indicated scores on the children's Form of the Manifest 
Anxiety Scale. 
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Fig, 2.05 Distribution of Children's Manifest Anxiety Scale score 
for boys and for girls. 
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in learning situations and anxiety as measured by a questionnaire. 

The relationship was demonstrated as significant for timed maze learning 
but not for digit symbol learning. 

A curvilinear relationship might be expected in terms of what is 
known about drive in relation to learning. As drive is increased, rate 
of learning also tends to increase; but as drive level exceeds a certain 
point performance becomes disorganized and declines. 

The anxiety scores of the subjects in this study, when plotted on a 
seven-point scale (shown in Figure 2.06) and compared to learning for 
each of the three tasks as well as for total learning, failed to show any 
such curvilinear relationship. This provides a sound basis for the 
rejection of the hypothesis which postulated that such a curvilinear 
relationship would be found existing between learning on the tasks and 
levels of anxi ety . 

The lack of a consistent relationship between anxiety and learning 
is emphasized by the correlations shown in Table 2 .06. For a correlation 
to be significant at the .05 level, r must be 0.35 when N=30 . Inspection 
of Table 2.03 shows that only the correlation of CMAS scores and learning 
with shock reinforcement for right responses on Task I reaches this level. 
Since one correlation in 20 could be expected to reach this magnitude by 
chance, one must conclude that the relationship between CMAS and task 
performance does not reach a significant size. 

A number of different analyses of variance may be used to advantage 
to bring out relevant relationships. What may have been an obscure or 
insignificant relationship in one analysis may emerge as clear-cut and 
significant when the variables are analyzed using a different breakdown 



of the data . 
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Table 2.06 

CORRELATIONS OF CMAS WITH NUMBER OF CORRECT ANTICIPATIONS IN THE 
THREE EXPERIMENTAL TASKS. CORRELATIONS ARE REPORTED 
SEPARATELY FOR THE SIX EXPERIMENTAL CONDITIONS.* 





Task 1 


Task II 


Task 1 1 1 


Task IV 


Oral Wrong 


-.21 


.11 


- .06 


-.08 


Shock Wrong 


..00 


-.12 


-.17 


-.11 


Tone Right 


.01 


-.23 


-.02 


-.09 


Oral Right 


.12 


-.07 


-.01 


.03 


Shock Right 


-.35 


-.15 


-.21 


-.29 


Tone Wrong 


.23 


-.14 


-.09 


.03 



*N=30, r required for significance at .05 level = 0.35 

A drawing showing the variables and their relationship is presented 
in Figure 2.07. The three columns represent reinforcement modes — oral, 
shock, and tone. The two rows include reinforcement of right responses 
and reinforcement of wrong responses. The layers consist of high and low 
anxiety levels. 

An analysis of variance for these variables using total learning scores 
is summarized in Table 2.07. The variable involved was the number of cor- 
rect responses over all learning trials for all tasks. This analysis 
points to a significant interaction effect of reinforcing of right and 
wrong responses on the one hand and reinforcement mode on the other, but 
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TABLE 2.07 

ANALYSIS OF VARIANCE FOR TOTAL NUMBER OF CORRECT RESPONSES 

TO ALL TASKS IN EXPERIMENT I . 






Source 


Sums of 
Squares 


df 


Mean Square 


F 


P 



Right-Wrong 


67.22 


1 


67.22 


Reinforcement Modes 


1134.18 


2 


567.09 


High-Low Anxiety 


583.12 


1 


583.12 


Right-Wrong 
Reinforcement Modes 


2708.04 


2 


1354.02 


Right-Wrong and 
High-Low Anxiety 


86.27 


1 


86.27 


Reinforcement Mode 
and High-Low Anxiety 


1362.81 


2 


680 .90 


Right-Wrong 
Reinforcement Mode 
and High-Low Anxiety 


330.40 


2 


165. 2 


Wi thi n Cells 


63875 .87 


168 


380.21 


Total 


70146.91 


179 





reference to Table 2.01 does little to clarify the nature of this inter- 
action and raises the question whether it is anything more than an oddity 

in this experiment. Anxiety does not appear as a significant main effect 

* ‘ 

in this analysis, and neither does it enter into a significant interaction. 

In the analysis as summarized, possible differences due to inter- 
task difficulty level were obscured by considering only total learning 
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for the three tasks. In order to further probe for significant inter- 
actions and differences, three additional analyses of variance were under- 
taken using a learning score derived from one task at a time. These 
analyses are shown in Tables 2.08, 2.09, snd 2.10. The same 2x2x3 
factorial design is involved, but learning is considered for each task 
separately rather than for the scores of all tasks added together. 

Table 2.08 shows oral, shock, and tone reinforcement methods to 
differ significantly in their roles as reinforcing agents. This was to 
be expected in terms of studies involving subhuman species. However, the 
same main effect is not found in the analyses of variance for Task II and 
III, a fact which minimizes any Importance which can be attributed to 
the finding with respect to Task I. 

A second effect which can be noted from the analysis shown In 
Table 2.08 is the significant Interaction between anxiety levels and 
reinforcing modes. Not only do the modes differ significantly but when 
anxiety and modes interact, the Interaction Is also significant. This 
interaction appears to represent the fact that the high anxious group 
was particularly affected by electric shock and in a negative direction. 

Table 2.09 shows that in Task II, learning of non-mean I ngful equa- 
tions, a single significant interaction between response modes and 
reinforcement of right-wrong responses emerges. At this point it is 
appropriate to recall that Task II proved to be much the hardest of the 
tasks, the means being: Task I, 26.1; Task II, 17*7; Task III, 26.7. 

This high level of task difficulty may account for differences in inter- 
actions shown to be significant for Task I and not for Task II . Thus, 
anxiety had no significant effect on learning of Task II. 
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TABLE 2 .08 

ANALYSIS OF VARIANCE FOR TASK I IN EXPERIMENT I. 



Source 


Sums of 
Squares 


df 


Mean Square 


F 


P 


Anxiety 


1 .07 


1 


1 .07 






Ri ght-Wrong 


1 .96 


1 


1 .96 






Reinforcement Modes 


123 .47 


2 


61 .74 


4 .24 


.05 


Anxiety and 
Ri ght-Wrong 


.30 


1 


.30 






Anxiety and 
Reinforcement Modes 


128.36 


2 


64.18 


4.40 


.05 


Right-Wrong and 
Reinforcement Modes 


23.79 


2 


11 .90 






Anxiety, Right- 
Wrong and Reinforce- 
ment Modes 


46.16 


2 


23 .08 






Wi thi n Cel 1 s 


2446 .93 


168 


14.57 
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TABLE 2.09 

ANALYSIS OF VARIANCE FOR TASK II IN EXPERIMENT I. 



Source 


Sums of 
Squares 


df 


Mean Square 


F P 


Anxiety 


20.83 


1 


20.83 




Ri ght-Wrong 


.01 


1 


.01 




Reinforcement Modes 


6.79 


2 


3.39 




Anxiety and 
Ri ght-Wrong 


.37 


1 


.37 




Anxiety and 
Reinforcement Modes 


13.51 


2 


6.76 




Right-Wrong and 
Reinforcement Modes 


95.05 


2 


47.52 


5.25 .05 

* • 


Anxiety, Right- 
Wrong and Reinforce- 
ment Modes 


1 .18 


2 


.58 




Within Cells 


1520.68 


168 


9.05 
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TABLE 2.10 

ANALYSIS OF VARIANCE FOR TASK III IN EXPERIMENT I. 





Source 


Sums of 
Squares 


df 


Mean Square 


F 


P 



AnxI ety 


18.14 


1 


18.14 


Ri ght-Wrong 


3.33 


1 


3.33 


Reinforcement Modes 


3.36 


2 


1 .68 


Anxiety and 
Ri ght-Wrong 


4.54 


1 


4.54 


Anxiety and 
Reinforcement Modes 


9.42 


2 


4.71 


Right-Wrong and 
Reinforcement Modes 


13.31 


2 


6 .66 


Anxiety, Right- 
Wrong and Reinforce- 
ment Modes 


46.55 


2 


23.28 


Within Cells 


1717.22 


10.22 
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Turning to the analysis of variance shown in Table 2.10, which 
provides an analysis of the data for the third task, no significant 
differences or interactions appear among the variables under observation. 

In the previous analysis of variance, scores derived fron all trials 
were summed to produce a total learning score. Some advantage is to be 
gained by using the separate scores from each trial since this permits 
the exercising of control over one source of variance which has not been 
controlled in previous analyses. Three additional analyses of variance 
were undertaken which involved between trials variance. These are shown 
for the three tasks in Tables 2.11, 2.12 and 2.13. 

Consider first the analysis for the data derived from Task I shown 
on Table 2.11. The between trials variance Is clearly and obviously 
significant as It would be expected to be. This Is hardly surprising 
for It simply means that learning has taken place. Two interactions, 
significant at the five percent level should be noted. First, the trials 
by anxiety by right-wrong reinforcement Interaction comes out clearly. 
Second, there is a significant Interaction of trials by anxiety by mode 
(oral-shock-tone). This Is fairly evident In the data which shows that 
the shock mode of reinforcement Is the one In which the high anxiety sub- 
jects have a noticeable depression of performance. 
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TABLE 2.11 

ANALYSIS OF VARIANCE FOR TRIALS LEARNING DATA 
FOR TASK I IN EXPERIMENT I 



Source of 
Variation 


Sums of 
Squares 


df 


Mean 

Square 


F 


P 


Trials 


1443 .4 


5 


288.68 


248.8 


.001 


Trials by anxiety 


4.51 


5 


.90 






Trials by right- 
wrong rei nforcement 


9.09 


5 


1 .82 






Trials by rein- 
forcing modes 


7.24 


10 


.72 






Trials by anxiety 
by right-wrong 


13.01 


5 


2.60 


2.24 


.05 


Trials by anxiety 
by reinforcing modes 


22.39 


10 


2.24 


1 .93 


.05 


Trials by right- 
wrong by reinforcing 
modes 


8.84 


10 


.88 






Trials by anxiety 
by right-wrong by 
reinforcing modes 


15.93 


10 


1.59 


1 .30 




Residue 1 


973 .60 


840 


1 .16 


1 .16 
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Table 2.12 presents an analysis of variance of trials learning data 
for Task II. This analysis was unproductive of significant interactions, 
though the main effects of trials was, of course, clearly significant. 

The analysis of variance for trials learning in Task 111 is shown in 
Table 2.13. Here are found two further interactions which closely approach 
significance. Both of these involve the anxiety variable. These triple 
interactions are (1) trials by right-wrong reinforcement and (2) trials 
by anxiety by reinforcing modes (oral, shock, and tone). The fact that 
both of these triple interactions involve a trials term indicates that 
the effect of anxiety is not uniform over the learning procedure. 

Lie Scale Scores and Learni nq 

Although anxiety did not emerge as a significant main effect in the 
study, lie scale scores showed a consistent negative correlation with 
total learning. Correlations for lie score and total learning for the 
treatment conditions are shown in Table 2.14. The results agree with 
those previously reported by Axelrod, Cowen, and Heilizer (1956). 
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TABLE 2.12 

ANALYSIS OF VARIANCE FOR TRIALS LEARNING DATA FOR 
TASK II IN EXPERIMENT I . 



Source of 


Sums of 




Mean 






Variation 


Squares 


df 


Square 


F 


P 


Trials 


610.42 


5 


122.08 


95.22 


.001 


Trials by anxiety 


4.70 


5 


..94 






Trials by right- 
wrong reinforcement 


4.80 


5 


.96 






Trials by reinforc- 
i ng modes 


9.43 


10 


.94 






Trials by anxiety 
by right-wrong 


8.36 


5 


1 .67 


1 .30 


(2.22= 


Trials by anxiety 
by reinforcing 
modes 


7.55 


10 


.76 






Trials by right- 
wrong by rein- 
forcing modes 


12.33 


10 


1 .23 


.96 




Trials by anxiety 


1 1 .22 


10 


1.12 


.85 





by right-wrong by 
reinforcing modes 



1 .28 



Residual 



1077.17 840 
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TABLE 2.13 

ANALYSIS OF VARIANCE FOR TRIALS LEARNING DATA FOR 
TASK I IN EXPERIMENT I . 



Source of 
Variation 


Sums of 
Squares 


df 


Mean 

Square 


F 


P 


Trials 


806.76 


5 


161 .35 


131 .17 


.001 


Trials by anxiety 


7.70 


5 


1 .54 


1 .25 




Trials by right- 

wrong reinforcement 7*10 


5 


1 .42 


1.15 




Trials by rein- 
forcing modes 


21 .99 


10 


2.20 


1 .78 


(1 .84=. 05) 


Trials by anxiety 
by right-wrong 


6.98 


5 


1 .40 


1.13 




Trials by anxiety by 
reinforcing modes 8.66 


10 


.87 






Trials b> right- 
wrong by reinforc- 
ing modes 


12.85 


10 


1 .29 


1 .04 




Trials by anxiety by 
right-wrong by re- 
inforcing modes 21.88 


10 


2.19 


1 .78 


(1 .84=05) 


Residua 1 


1035.03 


840 


1 .23 
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TABLE 2.14 

RELATIONSHIP OF LIE SCORE ON THE ANXIETY SCALE TO TOTAL 
PERFORMANCE ON THE THREE TASKS (N=30, p=0.05 for r=0.35) 



Reinforcing Condition Correlation 



Oral wrong 


-0.45 


Shock wrong 


-0.31 


Tone wrong 


-0.02 


Oral right 


-0.31 


Shock right 


-0.22 


Tone right 


-0.43 








EXPERIMENT 2 
Method and Procedure 

In the previous experiment, subjects could learn each task without 
the reinforcement following each trial. In the second experiment the 
tasks were so modified that the reinforcing events were necessary for 
learning. In order for this to be the case, the response to each stimulus 
term was modified so that it came from a restricted number of categories. 

In the case of Task I the correct response to the stimulus word was either 
the word "long" or the word "short." On Task II the correct response was 
either A or B and on Task III either 1 or 2. Subjects were told what the 
alternatives were before each task and at no time was the stimulus presented 
together with the correct response on the memory drum. 

Design 

The design was essentially the same as in the previous experiment 
with either right responses or wrong responses reinforced, with three 
modes of reinforcement (shock, oral, tone), two levels of anxiety, and 
also two levels of difficulty of the tasks. 

Task 

The task involved the learning of a correct response to a series of 
stimuli. The directions were essentially the same as those used in 
experiment 1 except that the subject was told in the first task that the 
answer was always either the word "long" or the word "short." In the case 
of the second task the response to each stimulus was either "a" or "b" 



with only one being correct. In the case of the third task the correct 
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response to each stimulus was either 1 or 2. After each stimulus^ the 
subject guessed which one of the responses was correct and received feed- 
back in accordance with the treatment to which the particular subject had 
been assigned. In contrast with the previous task the subjects in experi- 
ment 2 could learn only through the feedback provided since a correct and 
complete presentation of the stimulus and response did not follow each 
presentation. All subjects completed all three tasks in the same order. 
Each subject was run for six trials on each task and his score was the 
number of correct responses over all six trials. 

Tasks 

The tasks used in experiment 2 are shown in Table 2.15. Each task 
was used in a long and a short form. 

* 

Subjects 

The subjects were 192 boys and girls with an age range closely similar 
to those in the previous study. They also were obtained In a residential 
neighborhood and the experiment was conducted In the basement of one of the 
houses. Subjects were assigned to experimental treatments by drawing a tag 
from a hat. The tag Indicated the treatment to which they were assigned. 

Resu 1 ts 

Since In this second experiment, learning cannot occur without the 
signals or reInforcers, one might have expected substantial differences 
between the main effects. Tables 2.16, 2.17> 2.18 and 2.19 show the 
means and standard deviation for the groups representing the various 
treatments. The tables are presented by task and also for total per- 



formance on the three tasks. 
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TABLE 2.15 

TASKS USED IN EXPERIMENT 2. 



Task I Task I I 

Stimulus Response Stimulus Response 



Task I I I 

Stimulus Response 



stem 


long 


r 


X 


s = 


a 


5 


X 


5 


= 


1 


house 


short 


b 


X 


y = 


b 


''3 


X 


1 


= 


2 


room 


long 


b 


X 


d = 


b 


3 


X 


3 


= 


2 


*soft 


long 


a 


X 


m = 


b 


''3 


X 


4 


= 


1 


tree 


short 


s 


X 


q = 


a 


5 


X 


1 


= 


1 


nest 


long 


*k 


X 


u = 


a 


6 


X 


3 


= 


2 


‘'<wh i te 


short 




X 


1 = 


b 


2 


X 


5 


= 


1 


walk 


short 


1 


X 


b = 


a 


2 


X 


4 


= 


2 



*These items were omitted to produce the shorter task. 

The means do not show any particularly striking differences between 
reinforcing modes such as has appeared in data involving rats. Indeed, 
the differences are very small and inconsistent from task to task. The 
table of means do suggest that there may be differences between the high 
and low anxiety groups. While the differences between the means are small, 
they are consistent from task to task. 

In addition, there appears to be a small but consistent difference 
between the learning of right and the learning of wrong responses with 
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the reinforcement of wrong responses producing consistently slightly 
better learning. 

While the table of means and standard deviations is suggestive of 
the effects of differences in treatments, the testing of the hypotheses 
implicit in the design require the use of analysis of variance. The 
analyses of variance corresponding to the table of means and standard 
deviations are shown in Table 2.20, 2.21, 2.22, and 2.23. These tables 
present separately the analyses of variance for each task and for the scores 
derived from ail three tasks combined. 

The F tests do not provide a very impressive array of significant 
differences. The most marked is that related to whether right or wrong 
responses are reinforced. While the direction of this difference is the 
same across ail three tasks, the effect is seen mainly with Task I. From 
the data one cannot tell whether this is a result of the nature of the 
task or whether it is due to the fact that the particular task was given 
first and that during this task the subject underwent adaptation to the 
learning situation with a resulting change in response to subsequent tasks. 

The F test related to the long-short task comparisons has not been 
computed since it is of trivial interest. The long task - short task 
comparison is only of interest in the interactions where it did not 
produce any significant F. 



Discussion 

When the results of the study as a whole are considered, one cannot 
help being impressed with the contrast between the data provided by 
children in the shock situation with the data derived from studies with 
rats. While rats learn at a vastly faster rate when shocked for incorrect 
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TABLE 2.20 



ANALYSIS OF VARIANCE FOR ALL TASKS COMBINED FOR 
NUMBER OF CORRECT RESPONSES IN EXPERIMENT 2 



Source of 
Va r i a t i on 


Sums of 
Squares 


df 


Mean 

Square F P 


A (treatment) 


558.78 


2 


279.39 


B (anxiety) 


796.26 


1 


796.26 


C (long-short) 


71804.01 


1 


71804.01 


D (right-wrong) 


1982.76 


1 


1982.76 4.14 <.05 


AB 


720.82 


2 


360.41 


AC 


1270.82 


2 


635 .41 


AD 


254.76 


2 


127.38 


BC 


6.37 


1 


6.37 


BD 


497.29 


1 


497.29 


CD 


218.87 


1 


218.87 


ABC 


851 .08 


2 


425.54 


ABD 


381 .78 


2 


190.89 


ACD 


65.39 


2 


32.70 


BCD 


174.44 


1 


174.44 


ABCD 


2121 .83 


2 


1060.92 


Wi thi n 


80439.38 


168 


478.81 


Tota 1 


162665.46 
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TABLE 2.21 



ANALYSIS 


OF VARIANCE FOR 
RESPONSES 


TASK 1 FOR NUMBER OF 
IN EXPERIMENT 2. 


CORRECT 


Source 


Sums of 


df 


Mean 




of Variation 


Squares 




Square 


F P 


A (treatment) 


140.32 


2 


70.2 




B (anxiety) 


368.52 


1 


368.52 


3.35 <M0 


C (long-short) 


8829.19 


1 


8829.19 




D (right-wrong) 


918.75 


1 


918.75 


8.35 <.001 


AB 


19.20 


2 


9.60 




AC 


86.16 


2 


43.08 




AD 


54.03 


2 


27.02 




BC 


17.52 


1 


17.52 




BD 


320.33 


1 


320.33 


2.91 .05 


CD 


90.75 


1 


90.75 




ABC 


12.20 


2 


6.10 




ABD 


333.82 


2 


166.91 




ACD 


114.40 


2 


57.20 




BCD 


126.75 


1 


126.75 




ABCD 


220.28 


2 


110.14 




Wi thin 


18477.75 


168 


110.00 




Tota 1 


30129.98 
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TABLE 2.22 

ANALYSIS OF VARIANCE FOR TASK II FOR NUMBER OF 
CORRECT RESPONSES IN EXPERIMENT 2 



Source of 
Variation 


Sums of 
Squares 


df 


Mea n 

Square p p 


A (treatment) 


137.89 


2 


68.95 


B (anxiety) 


17.52 


1 


17.52 


C (long-short) 


12033.33 


1 


12033.33 


D (right-wrong) 


63.02 


1 


63.02 


AB 


69.51 


2 


34.76 


AC 


83.32 


2 


41 .66 


AD 


20.14 


2 


10.07 


BC 


4.69 


1 


4.69 


BD 


30.08 


1 


30.08 


CD 


.19 


1 


.19 


ABC 


228,66 


2 


114.33 


ABD 


197.89 


2 


98.95 


ACD 


250.91 


1 


250.91 3.57 . <.il 0 


BCD 


120.33 


2 


60.17 


ABCD 


286.95 


2 


143 .48 


Within 


11810.25 


168 


70.30 


Tota 1 


25354.67 
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responses, the children do not. While one might have expected that in 
the shock-wrong condition the children would have learned more rapidly in 
order to escape from the shock--and this would appear to be a ready means 
of escape — the fact was that this group did not perform better than the 
groups which received no shock. In addition, the group that received 
shock for correct answers was not particularly slowed down by this pro- 
cedure despite the fact that the better they learned the more times they 
received shock. The latter group was aware of the fact that the task was 
to be continued through six trials even if they gave a perfect performance 
on earlier trials. 

A second point to note is that any significant differences which did 
appear tended to appear on the first of the three tasks. Since the tasks 
were not administered in a counterbalanced order one cannot tell whether 
the effect is a result of the fact that the task yielding significant 
differences appeared first or whether it was the simplest task. Indirect 
evidence suggests that the effect was probably due to the fact that the 
task appeared first and that the significant differences were a result of 
the difficulties which the subjects had in adjusting to the conditions of 
learning. Certainly, even those who have worked with rats report that 
the animals become adapted to electric shock. In this particular study, 

E had the impression that the children showed a similar tendency to adapt. 
A general conclusion indicated by the data is that the reinforcing value 
of an event for the subjects considered here is the amount of information 
which it contains. 

in the present study, the reinforcers contingent upon incorrect 
responses provided the same amount of information as the reinforcers 
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contingent upon correct responses. In the second experiment a significant 
difference was found between the rate of acquisition when wrong responses 
were reinforced in contrast to the reinforcement of right responses. The 
direction of this trend is opposite to that found in some studies of con- 
cept learning, as for example that of Bruner ejt (1956), which have 
reported that subjects derived more information from the positive reinforce 
ment of correct responses than from the negative reinforcement of incorrect 
responses. If a similar finding were to occur in other studies of acquisi- 
tion it would raise an important question for those designing programmed 
materials who have commonly taken the position that the program should be 
arranged so that the subject makes only correct responses. Human subjects 
appear to be much more capable of learning from their errors than some 
psychologists have supposed them to be. 
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Although the studies considered in this section of the report 
provide a wide range of reinforcing events with greatly varying charac- 
teristics, the results do not indicate a strong relationship between the 
nature of the reinforcing events and rate of acquisition of the particu- 
lar task involved. This conclusion holds even when there were drammat ic 
differences in the reinforcing events as was the case when electric 
shock was compared with verbal approval indicated by the word "Right." 

The reinforcers used in the studies served two functions. First, 
they provided information concerning a correct or appropriate answer. 

The different reinforcing contingencies provided equal amounts of infor- 
mation within any one experiment and, hence, comparisons could be made 
in terms of the other characteristics of the reinforcing event. The 
second characteristic of the reinforcing event which is commonly con- 
sidered to have special significance is the capacity it has of producing 
affect. Although the informational property of a reinforcing event can 
be readily assessed, its capacity to produce affective responses is not 
readily determined. The evidence that a reinforcing event has strong 
affective properties is generally qualitative and verbal and is reflected 
by such statements as "I don't like that," "Do you have to do that again? 
or "That's fun." Other evidence that events have affective properties 
are shown in the tendency to avoid further events in the same category 
or to seek out repetition of the events. One could also conceive of 
obtaining evidence of affect iv* r, e llirough records of visceral 

phenomenon such as blood pressure, sweating, changes in peristalsis and 
so forth, but such evidences are difficult to collect. In the case of 
the studies that have been reported, it seems clear that 



o 



the electric 



shock used produces strong negative affective response while the word 
"Right” might be expected to produce mild positive affect. Though 
these effects cannot be considered as in equal and opposite directions 
they can be presumed to be sufficiently disparate for our purposes. 

The only striking differences between the reinforcing properties 
of events are found in the first of the two studies reported in which 
it was shown that a physical event, involving the delivery of a marble, 
had greater reinforcing value than approval when all age groups involved 
were combined. A second finding was that for the very young children 
the girls showed a more marked tendency to respond to approval than 
girls in the older age groups. On the other hand, in the study involv- 
ing electric shock in which the aversive property of this reinforcing 
event was beyond question there were no notable differences in rate of 
acquisition in treatments involving this reinforcing event than in treat- 
ments involving events which might be considered to produce positive 
affect. These results are in marked contrast with those derived from 
animal experiments in which the introduction of shock has been shown to 
have dr am at ic effects on the rate of acquisition both in motor learning 
and perceptual discrimination tasks. The data presented here implies 
that affective responses associated with reinforcers have vastly greater 
influence on the learning process in subhuman than in human subjects. 

In the learning situations considered here, the human responds to the 
information supplied rather than to other characteristics of the rein- 
forcing event. One suspects that the human would learn more effectively 
from reinforcers carrying a large amount of information but producing 
negative affect than from reinforcers providing much less information 
but producing positive affect. 
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The implication of these studies for education is that human learn- 
ing should generally be planned in such a way that the reinforcing events 
supply information in the clearest and most unambiguous form. Informa- 
tion hidden in a cloak of words of praise or blame may loose part of 
its impact. Indeed, part of the information may be lost when it is thus 
encapsulated. 







